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(54) DOCUMENT FILE RETRIEVAL DEVICE AND MACHINE READABLE RECORDING MEDIUM 
RECORDING PROGRAM 

(57)Abstract: 

PROBLEM TO BE SOLVED: To realize a retrieval inquiry about 
a WWW home page by a natural language. 
SOLUTION: A WWW home page being a retrieval object 
document file is described in an XML. When a retrieval 
condition composition is inputted, a keyword extraction part 4 
converts a natural language expression expressing an attribute 
name into an attribute name index including the attribute name 
and also converts the natural language expression expressing 
the attribute value into an attribute value index including a pair 
of the said attribute name and attribute value. A keyword filter 
part 5 deletes the attribute name index existing at a place 
where the attribute name and the attribute value of the same 
attribute exist adjacent to each other in a converted index 
string. A document contents check part 6 checks whether or not 
a tag corresponding to pairs of attribute name and value of the all attribute value index exists in the 
retrieval object document file. If the said tag exists, a document contents output part 9 retrieves and 
outputs the attribute value of the tag having the relevant attribute name of the attribute name index 
that is included in the converted index string. 
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[B5] 



<!--X*tvud 7*074 -;i/©7r-f;u--> < ctUi^ > hfr 

<?xml version="1.0", encoding=" shift jis"?> 

< ! DOCTYPE mi&mtfnyj -;USYSTEM"http://report/hotel. xml" > 
<7 e o7-f -;HBH> 

< *fvu> x*f- ;!/</* r7l/> 

< W> If > 7 7 > =j </±§F/t> 

$150</{IIS> 

<7°D7-{ —A>> 
</7°D7-f -;b> 

</7°P7-r -;HE^> 



<!-Z*x;KD7 5 n7^ -;l/©7 7-f > <*= Ctltea^ > hfr 
<?xml version="l. 0", encoding="shif t j is"?> 

<! DOCTYPE ?t>£fiifH7°o7 4 -;PSYSTBM"http://report/hotel. xml* > 

<7°o7^ -;b|S^> 

< * -r ;b > Z * -f ;k /* f- )]/ > 

<i3r>->7h;K/M> 

<tt»>$180</«»> 

<^-y h >^Dj</-<>y h > 
<7 > P7-f — Jl> 

</-?nyj -jl> 

• • * 

</-?n7 4-A4Egi> 
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♦NOTICES*^ 

JPO and NCIPI are not responsible for any 
damages caused by the use of this translation. 

1 .This document has been translated by computer. So the translation may not reflect the original precisely. 
2.**** shows the word which can not be translated. 
3. In the drawings, any words are not translated. 

CLAIMS 

[Claim(s)] t . 

[Claim 1] In the text file retrieval equipment with which the text file which contains the pair of the attribute name and the 
attribute value of an attribute of an attribute is used as the text file for retrieval, and the part which suits the retrieval 
conditions which the user specified with natural language is searched from the text file for retrieval Investigate the 
retrieval demand sentence expressed with natural language sequentially from a head, and the attribute name is outputted 
as an attribute name index to the natural language expression expressing an attribute name. The keyword extraction 
section which performs outputting the pair of the attribute value and attribute name as an attribute value index to the 
natural language expression expressing attribute value one by one, Only when the output of said keyword extraction 
section is inputted, it investigates sequentially from a head and the attribute name index and attribute value index of the 
same attribute exist in next doors The keyword filter section which deletes said attribute name index and outputs the 
other part as it is, It investigates whether the pair of the attribute name of all attribute value indexes and attribute value 
which were outputted from said keyword filter section into the text file for retrieval is built in. Text file retrieval 
equipment equipped with a retrieval means to search and output the attribute value corresponding to the attribute name of 
the attribute name index outputted from said keyword filter section from the text file for retrieval when built. 
[Claim 2] The attribute name storing dictionary which stores the pair with the natural language expression expressing an 
attribute name and its attribute name about the attribute name which exists in the text file for retrieval, It has the attribute 
value storing dictionary which stores 3 groups with the natural language expression which expresses the attribute name 
corresponding to attribute value and its attribute value, and its attribute value about the attribute value which exists in the 
text file for retrieval. Said keyword extraction section investigates the retrieval demand sentence expressed with natural 
language sequentially from a head, and refers to an attribute name storing dictionary. If the natural language expression 
expressing an attribute name is included, will output the attribute name which is the natural language expression and pair 
as an attribute name index, and an attribute value storing dictionary will be referred to. Text file retrieval equipment 
according to claim 1 characterized by having the configuration which outputs a pair of set with the natural language 
expression, the attribute value which is 3 groups, and an attribute name as an attribute value index if the natural language 
expression expressing attribute value is included. 

[Claim 3-] The text file which contains the pair of the tag which the attribute name expressing the semantics written into 
the document attached, and the attribute value of the attribute is used as the text file for retrieval. In the text file retrieval 
equipment with which the part which suits the retrieval conditions which the user specified with natural language from 
the text file for retrieval is searched The attribute name storing dictionary which stores a pair of set with the natural 
language expression expressing an attribute name and its attribute name about the attribute name which exists in the text 
file for retrieval, The attribute value storing dictionary which stores the set of 3 groups with the natural language 
expression which expresses the attribute name corresponding to attribute value and its attribute value, and its attribute 
value about the attribute value which exists in the text file for retrieval, Investigate the retrieval demand sentence 
expressed with natural language sequentially from a head, and an attribute name storing dictionary is referred to. If the 
natural language expression expressing an attribute name is included, will output the attribute name which is the natural 
language exoression and pair as an attribute name index, and an attribute value storing dictionary will be referred to. The 
keyword extraction section which will output the pair of the natural language expression, the attribute value which is 3 
groups and an attribute name as an attribute value index if the natural language expression expressing attribute value is 
included Only when the output of the keyword extraction section is inputted, it investigates sequentially from a head and 
the attribute name index and attribute value index of the same attribute exist in next doors The keyword filter section 
which deletes said attribute name index and outputs the other part as it is, It investigates whether the pair of the tag 
corresponding to the pair of the attribute name of all attribute value indexes and attribute value which were outputted 
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from said keyword filter section into the text file for retrieval is built in. Text file retrieval equipment equipped with a 
retrieval means to search and output the attribute value of the tag which has the attribute name of the attribute name 
index outputted from said keyword filter section when built from the text file for retrieval. 

[Claim 4] From the set of the text file to build [ two or more ] in, the pair of the tag which the attribute name expressing 
the semantics written into the document attached, and the value of the attribute In the text file retrieval equipment which 
chooses the text file with which are satisfied of the retrieval conditions which the user specified with natural language, 
and displays the suiting part The text file term document which stores the identifier and existence location of all the text 
files used as the candidate for retrieval, The attribute name storing dictionary which stores a pair of set with the natural 
language expression expressing an attribute name and its attribute name about the attribute name which exists in the text 
file used as the candidate for retrieval, The attribute value storing dictionary which stores the set of 3 groups with the 
natural language expression which expresses the attribute name corresponding to attribute value and its attribute value, 
and its attribute value about the attribute value which exists in the text file used as the candidate for retrieval, If a user 
inputs the retrieval demand sentence expressed with natural language, he will investigate said input statement 
sequentially from a head, and will refer to an attribute name storing dictionary. If the natural language expression 
expressing an attribute name is included, will output the attribute name which is the natural language expression and pair 
as an attribute name index, and an attribute value storing dictionary will be referred to. The keyword extraction section 
which performs one by one outputting a pair of set with the natural language expression, the attribute value which is 3 
groups, and an attribute name as an attribute value index if the natural language expression expressing attribute value is 
included, Only when the output of the keyword extraction section is inputted, it investigates sequentially from a head and 
the attribute name index and attribute value index of the same attribute exist in next door through The keyword filter 
section which deletes said attribute name index and outputs the other part as it is, When the contents and the attribute 
value index of a text file are inputted, in the contents of said text file Investigate whether a tag including the attribute 
name in said attribute value index exists, and when it exists Take out the attribute value which exists by the tag and pair, 
and whether the value is equal to the attribute value in said attribute value index investigation and when equal With the 
contents Banking Inspection Department of a document which outputs success, and does a rejected output when that is 
not right If the contents of the text file and one or more attribute value indexes are inputted Take out one at a time from 
said attribute value index, and the contents and said every one taken-out attribute value index of said text file are passed 
to the contents Banking Inspection Department of a document. To all attribute value indexes, when the output is success 
With the contents Banking Inspection Department of an integrated document which outputs success, and outputs a 
rejection when that is not right Every one contents of the text file are taken out with reference to said text file term 
document. The contents of said document, and the part of the attribute value index of the outputs of the keyword filter 
section to the contents Banking Inspection Department of an integrated document Delivery, The success text file name 
sorting section to which it performs receiving the output of said contents Banking Inspection Department of an integrated 
document to said all text files that it took out one [ at a time ], and said output outputs only the identifier of the text file 
of success, If the contents of a text file name and said text file name and the attribute name index which is the output of 
the keyword filter section are inputted When one of said attribute name indexes is taken out, and it investigates whether a 
tag including said taken-out attribute name exists and exists in the contents of said given text file with the contents output 
section of a document which said inputted attribute name index is alike, respectively, receives, and performs displaying 
the value and said inputted text file name of a tag of the attribute name on a user, and outputting nothing in not existing 
Input the set of the text file name which is the output of said success text file name sorting section, and a text file name 
storing dictionary is referred to. Text file retrieval equipment characterized by having the contents output-control section 
of a document which repeats performing taking out said every one inputted element of a set of a text file name, and 
passing the contents output section of a document to all the text file names under said input. 

[Claim 5] The text file which contains the pair of the attribute name and the attribute value of an attribute of an attribute 
is used as the text file for retrieval. The computer which constitutes the text file retrieval equipment with which the part 
which suits the retrieval conditions which the user specified with natural language is searched from a text file for 
retrieval Investigate the retrieval demand sentence expressed with natural language sequentially from a head, and the 
attribute name is outputted as an attribute name index to the natural language expression expressing an attribute name. 
The keyword extraction section which performs outputting the pair of the attribute value and attribute name as an 
attribute value index to the natural language expression expressing attribute value one by one, Only when the output of 
said keyword extraction section is inputted, it investigates sequentially from a head and the attribute name index and 
attribute value index of the same attribute exist in next doors The keyword filter section which deletes said attribute 
name index and outputs the other part as it is, It investigates whether the pair of the attribute name of all attribute value 
indexes and attribute value which were outputted from said keyword filter section into the text file for retrieval is built 
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in. The record medium which recorded the program which considers as a retrieval means to search and output the 
attribute value corresponding to the attribute name of the attribute name index outputted from said keyword filter section 
from the text file for retrieval when built, and is operated and in which machine reading is possible. 
[Claim 6] The text file which contains the pair of the tag which the attribute name expressing the semantics written into 
the document attached, and the attribute value of the attribute is used as the text file for retrieval. The computer which 
constitutes the text file retrieval equipment with which the part which suits the retrieval conditions which the user 
specified with natural language is searched from a text file for retrieval The attribute name storing dictionary which 
stores a pair of set with the natural language expression expressing an attribute name and its attribute name about the 
attribute name which exists in the text file for retrieval, The attribute value storing dictionary which stores the set of 3 
groups with the natural language expression which expresses the attribute name corresponding to attribute value and its 
attribute value, and its attribute value about the attribute value which exists in the text file for retrieval, Investigate the 
retrieval demand sentence expressed with natural language sequentially from a head, and an attribute name storing 
dictionary is referred to. If the natural language expression expressing an attribute name is included, will output the 
attribute name which is the natural language expression and pair as an attribute name index, and an attribute value 
storing dictionary will be referred to. The keyword extraction section which will output the pair of the natural language 
expression, the attribute value which is 3 groups, and an attribute name as an attribute value index if the natural language 
expression expressing attribute value is included, Only when the output of the keyword extraction section is inputted, it 
investigates sequentially from a head and the attribute name index and attribute value index of the same attribute exist in 
next doors The keyword filter section which deletes said attribute name index and outputs the other part as it is, It 
investigates whether the pair of the tag corresponding to the pair of the attribute name of all attribute value indexes and 
attribute value which were outputted from said keyword filter section into the text file for retrieval is built in. The record 
medium which recorded the program which considers as a retrieval means to search and output the attribute value of the 
tag which has the attribute name of the attribute name index outputted from said keyword filter section when built from 
the text file for retrieval, and is operated and in which machine reading is possible. 

[Claim 7] From the set of the text file to build [ two or more ] in, the pair of the tag which the attribute name expressing 
the semantics written into the document attached, and the value of the attribute The computer which constitutes the text 
file retrieval equipment which chooses the text file with which are satisfied of the retrieval conditions which the user 
specified with natural language, and displays the suiting part The text file term document which stores the identifier and 
existence location of all the text files used as the candidate for retrieval, The attribute name storing dictionary which 
stores a pair of set with the natural language expression expressing an attribute name and its attribute name about the 
attribute name which exists in the text file used as the candidate for retrieval, The attribute value storing dictionary which 
stores the set of 3 groups with the natural language expression which expresses the attribute name corresponding to 
attribute value and its attribute value, and its attribute value about the attribute value which exists in the text file used as 
the candidate for retrieval, If a user inputs the retrieval demand sentence expressed with natural language, he will 
investigate said input statement sequentially from a head, and will refer to an attribute name storing dictionary. If the 
natural language expression expressing an attribute name is included, will output the attribute name which is the natural 
language expression and pair as an attribute name index, and an attribute value storing dictionary will be referred to. The 
keyword extraction section which performs one by one outputting a pair of set with the natural language expression, the 
attribute value which is 3 groups, and an attribute name as an attribute value index if the natural language expression 
expressing attribute value is included, Only when the output of the keyword extraction section is inputted, it investigates 
sequentially from a head and the attribute name index and attribute value index of the same attribute exist in next door 
through Delete said attribute name index, and if the keyword filter section, and the contents and the attribute value index 
of a text file outputted as it is are inputted, the other part When it investigates whether a tag including the attribute name 
in said attribute value index exists and exists in the contents of said text file Take out the attribute value which exists by 
the tag and pair, and whether the value is equal to the attribute value in said attribute value index investigation and when 
equal If success is outputted, and the contents Banking Inspection Department of a document, and the contents of the text 
file and one or more attribute value indexes which carry out a rejected output are inputted when that is not right Take out 
one at a time from said attribute value index, and the contents and said every one taken-out attribute value index of said 
text file are passed to the contents Banking Inspection Department of a document. To all attribute value indexes, when 
the output is success The contents Banking Inspection Department of an integrated document which outputs success, and 
outputs a rejection when that is not right, and said text file term document are referred to. Every one contents of the text 
file are taken out. The contents of said document, and the part of the attribute value index of the outputs of the keyword 
filter section to the contents Banking Inspection Department of an integrated document Delivery, It performs receiving 
the output of said contents Banking Inspection Department of an integrated document to said all text files that it took out 
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one [ at a time ]. If the attribute name index said whose output is an output of the success text file name sorting section 
which outputs only the identifier of the text file of success, and the contents of a text file name and said text file name 
and the keyword filter section is inputted When one of said attribute name indexes is taken out, and it investigates 
whether a tag including said taken-out attribute name exists and exists in the contents of said given text file The value 
and said inputted text file name of a tag of the attribute name are displayed on a user, the contents output section of a 
document which said inputted attribute name index is alike, respectively, receives, and performs outputting nothing in 
not existing Input the set of the text file name which is the output of said success text file name sorting section, and a 
text file name storing dictionary is referred to. The record medium which recorded the program which makes it the 
contents output-control section of a document to repeat to perform taking out said every one inputted element of a set of 
a text file name, and passing the contents output section of a document to all the text file names under said input, and is 
operated and in which machine reading is possible. 



[Translation done.] 
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* NOTICES * 

JPO and NCIPI are not responsible for any 
damages caused by the use of this translation. 

1 .This document has been translated by computer. So the translation may not reflect the original precisely. 

2 **** shows the word which can not be translated. 
3. In the drawings, any words are not translated. 

DETAILED DESCRIPTION 

[Detailed Description of the Invention] 
[0001] 

[Field of the Invention] Especially this invention relates to the text file retrieval equipment which made the retrieval 

inquiry by natural language possible about text file retrieval equipment. 

[0002] 

[Description of the Prior Art] When it is going to make a precision express a retrieval intention of a user more in 
information retrieval generally, the method of making it express with natural language like Japanese or English is 
effective. The system which performs retrieval to a database with natural language already exists, and is called the 
natural language interface (bibliography: DIBEROPPINGUANACHURARURANGEJI interface two complex data and 
G . G . "Developing a Natural Language Interface to Complex Data" besides Hendrix, ACM Trans.on Database Systems, 
1978.). 

[0003] The conventional natural language interface interprets the retrieval inquiry by a user's natural language, changes 
the inquiry into the retrieval type of the query language (SQL) of a database, and shows a user the retrieval result to 
which the retrieval type was returned by database system from delivery and database system. 

[0004] However, although commercialization already started and the conventional natural language interface left for 20 
years or more, ithas not reached the level of practical use yet. Since the functor in which a natural language interface 
system cannot interpret a free inquiry of a user, but the system permits one of the reason of the, and a vocabulary are 
limited clearly, a user is for having to memorize what kind of expression can be used after all. Therefore, a natural 
language interface is not different from a complicated command system. That is, it was a problem to say that the 
conventional natural language interface cannot receive a user's free expression (bibliography : 

DIBEROPPINGUANACHURARURANGEJI interface two complex data and G . G . "Developing a Natural Language 
Interface to Complex Data" besides Hendrix, ACM Trans.on Database Systems, 1978.). It was especially a problem that 
it is not allowed although he wants to use not the natural language with a perfect user about an easy question but a brief 
expression. As an example of a brief expression, there are a part of expression by the keyword train, nongrammatical 
expression or natural language sentence, and **. 

[0005] Then, these people proposed the "natural language interpretation approach" of realizing retrieval by the natural 
language of a brief expression, in previous patent application (JP,5-67136,A). This makes applicable to retrieval the table 
of the database which is the assembly of the group of the attribute name and the attribute value of an attribute of an 
attribute. Each word in the inquiry sentence by natural language is classified into an attribute name, attribute value, and 
others. Save the word classified into the attribute name as a response attribute name, and the word classified into 
attribute value makes a group the attribute value and the attribute name corresponding to it, and it saves it as condition 
attribute value ****. When all the groups of the attribute value in this saved condition attribute value **** and an 
attribute name exist in a table, the attribute value in said table corresponding to each attribute name saved as said 
response attribute name is outputted as a reply to an inquiry. In addition, there is the "natural language interpretation 
approach" which starts JP,5-242147,A for which these people applied previously too as a conventional technique similar 
to this. 

[0006] On the other hand, it follows on expansion of use of World Wide Web (WWW) in recently, and is WWW. The 
importance of the upper retrieval technique is increasing. WWW A user is WWW. The typical tool used when carrying 
out information retrieval in a top is a search engine. As this example, Altavista, Infoseek, Lycos, etc. are famous. 
However, in a search engine, since it is the format of the retrieval which combines a keyword, the intention of retrieval 
of a user is not reflected directly in many cases. For example, a price is 150 when looking for the information about a 
hotel. It is possible to take a pet in a dollar, and it is impossible to express such a retrieval intention only along a keyword 
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to find the homepage of the hotel which is moreover near San Francisco. Temporarily, it is (1). It expresses by the 
formula. 150 A dollar, San Francisco, a hotel, and a pet are possible. - If (1) and a keyword are put in order, the list of 
very a lot of homepages will be outputted. 

[0007] A natural language interface system is introduced and it is WWW. If a homepage can be searched, precise 
retrieval conditions like the upper example will be expressed obediently, and only a suitable homepage can be searched. 
However, WWW The example which applied the natural language interface to retrieval of the upper homepage is not 
found. 

[0008] in addition, WWW although there be "information retrieval equipment" indicated by JP,10-40262,A as other 
examples of the retrieval technique over a homepage, since this aim at the information retrieval suitable for the 
sensibility of the user who do not have the retrieval conditions for [ clear ] retrieval by enable retrieval which made 
sensibility expression data the keyword, it be fit for retrieval by precise retrieval conditions like the upper example. 
[0009] 

[Problem(s) to be Solved by the Invention] Although the technique indicated by JP,5-67136,A mentioned above can 
serve as an effective means when putting a natural language interface system in practical use since it can interpret the 
inquiry by natural language by the simple approach, the technical problem which should still be solved is left behind. 
Since it is surely treated as a response attribute name and is included during the reply to an inquiry when a certain 
attribute name exists in the inquiry sentence by natural language, it is that a reply may become redundancy. 
[0010] For example, a table with "an attribute name = title, its attribute value = No Longer Human, an attribute name = 
author, and its attribute value = Dazaiosamu" is received. When the inquiry "the author of title = No Longer Human is ?" 
is performed, first Subsequently, "No Longer Human" is judged to be attribute value, a group with the "title" which is 
attribute value "No Longer Human" and its attribute name is saved as condition attribute value ****, a "title" is judged to 
be an attribute name, and it is saved as a response attribute name, and he is saved [ an "author" is judged to be an 
attribute name and ] as a response attribute name. And said table which has the attribute value in condition attribute 
value **** "No Longer Human" and a group with an attribute name "a title" is searched, and the attribute value "No 
Longer Human" and "Dazaiosamu" corresponding to a response attribute name "a title" and an "author" are searched and 
outputted from the table. That is, the part and reply which are outputting "No Longer Human" are redundancy. 
[001 1] Then, the purpose of this invention is to abolish the redundancy of the reply to the retrieval inquiry by natural 
language as much as possible. 

[0012] Moreover, other purposes of this invention are WWW. It is in making the retrieval inquiry by natural language 

possible also to a homepage. 

[0013] 

[Means for Solving the Problem] (1) In the technique indicated by JP,5-67136,A in which the 1st carried out invention 
****, although the user asked the attribute value corresponding to it and the reason the reply is redundancy has described 
it in a sentence when an attribute name exists in the inquiry sentence by natural language, it is because it is treating as a 
response attribute name uniformly. Then, when the attribute name and attribute value of the same attribute appear in next 
doors, he is trying not to include the attribute name in a response attribute name in this invention paying attention to the 
point in the inclination which a user adjoins them mutually and is described when describing a certain attribute name and 
the attribute value corresponding to it, in order to specify retrieval conditions in the inquiry sentence by natural language. 
The text file which contains the pair of the attribute name and the attribute value of an attribute of an attribute is more 
specifically used as the text file for retrieval. In the text file retrieval equipment with which the part which suits the 
retrieval conditions which the user specified with natural language from the text file for retrieval is searched Investigate 
the retrieval demand sentence expressed with natural language sequentially from a head, and the attribute name is 
outputted as an attribute name index to the natural language expression expressing an attribute name. The keyword 
extraction section which performs outputting the pair of the attribute value and attribute name as an attribute value index 
to the natural language expression expressing attribute value one by one, Only when the output of said keyword 
extraction section is inputted, it investigates sequentially from a head and the attribute name index and attribute value 
index of the same attribute exist in next doors The keyword filter section which deletes said attribute name index and 
outputs the other part as it is, It investigates whether the pair of the attribute name of all attribute value indexes and 
attribute value which were outputted from said keyword filter section into the text file for retrieval is built in. When built, 
it has a retrieval means to search and output the attribute value corresponding to the attribute name of the attribute name 
index outputted from said keyword filter section from the text file for retrieval 

[0014] Furthermore, in order to enable it to judge correctly which natural language expression which natural language 
expression in a retrieval demand sentence expresses an attribute name, and is expressing attribute value The attribute 
name storing dictionary which stores the pair with the natural language expression expressing an attribute name and its 
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attribute name about the attribute name which exists in the text file for retrieval, It has the attribute value storing 
dictionary which stores 3 groups with the natural language expression which expresses the attribute name corresponding 
to attribute value and its attribute value, and its attribute value about the attribute value which exists in the text file for 
retrieval. Said keyword extraction section investigates the retrieval demand sentence expressed with natural language 
sequentially from a head, and refers to an attribute name storing dictionary. If the natural language expression expressing 
an attribute name is included, will output the attribute name which is the natural language expression and pair as an 
attribute name index, and an attribute value storing dictionary will be referred to. If the natural language expression 
expressing attribute value is included, it has the configuration which outputs a pair of set with the natural language 
expression, the attribute value which is 3 groups, and an attribute name as an attribute value index. 
[0015] Thus, if it is in the text file retrieval equipment of constituted this invention When a user inputs the retrieval 
demand sentence expressed with natural language, first the keyword extraction section Investigate a retrieval demand 
sentence sequentially from a head, and the attribute name is outputted as an attribute name index to the natural language 
expression expressing an attribute name. To the natural language expression expressing attribute value, the pair of the 
attribute value and attribute name is outputted as an attribute value index. Only when the keyword filter section inputs 
the output of the keyword extraction section, and investigates sequentially from a head and the attribute name index and 
attribute value index of the same attribute exist in next doors, an attribute name index is deleted. Subsequently, 
subsequently It investigates whether the pair of the attribute name of all the attribute value indexes with which the 
retrieval means was outputted from the keyword filter section into the text file for retrieval, and attribute value is built in. 
When built, it has prevented the reply to a user becoming redundancy by searching and outputting the attribute value 
corresponding to the attribute name of the attribute name index outputted from the keyword filter section from the text 
file for retrieval. 

[0016] (2) 2nd invention WWW For applying a natural language interface to retrieval of the upper homepage, the 
difficult reason is WWW. The contents of the homepage are the files which consist of the texts and drawings which were 
drawn with natural language, and it is not being the set of an attribute name and attribute value like a database. That is, 
the target database [ interface system / which is the former / natural language ] is supposing there is a database of the 
hotel which comes out in the example of the term of a Prior art since it was the set of an attribute name and attribute 
value (identifier: X hotel, price:$150, pet:possibility of, location:San Francisco), 
(Identifier: Y hotel, price:$200, petimproper, location:Los Angeles) 
(Identifier: Z hotel, price:$180, petimproper, location:Seattle) 

** — it stores with a gestalt [ like ] ~ having — ****-- an inquiry of a user — (2) it expresses by the formula - as ~ "-- 
price =$150 and pet = — possible and location = San Francisco" 

~ (2) It was [ like ] convertible. This is SQL. It is changed into language as it is. However, WWW Since the information 
on such [ usually ] an attribute name and attribute value is not contained, a homepage is SQL. It is not convertible for the 
retrieval type which can be expressed by the formula. That is, the conventional WWW Homepage creation language is 
HTML (bibliography: the homepage of a World-Wide- Web consortium, URL http://www.w3.org). In HTML, the group 
of the attribute name for expressing the configuration and attribute value is built in in the text file. For example, it is the 
example of an HTML file which is shown in drawing 6 . here — < — what it is the attribute tag which was surrounded by 
> and used independently (example : <B>) Initiation tag (example : <TR>) Termination tag (example : </TR>) There are 
some which are used by the pair. The description of a HTML tag is limited to it defining the expression of the 
appearance in a text file. For example, the tag made a table-expression is <TABLE> and the tag showing line feed is 
expressed by <P>. It is WWW about such an HTML file. If it is made to read into a browser, it will become a gestalt as 
shown in drawin g 7 , and will be indicated by the output at a user. However, HTML cannot define the tag for expressing 
the semantics in a document. 

[0017] So, at this invention, it is WWW. The group of the attribute name and attribute value expressing the semantics in 
the text file is made to build in a text file. XML (Extensible Markup Language) specifically extended so that the contents 
of the document could be expressed by a pair of set of an attribute tag and its attribute value in a file describes a 
document (bibliography: the homepage of a World-Wide-Web consortium, "EKUSUTENSHIBURU markup language 
1.0 M http://www. w3.org/TR/PR-xml -971208). . XML WWW The specification was proposed by the World- Wide- Web 
consortium (bibliography: the homepage of a World-Wide- Web consortium, URL http://www.w3.org) which is the 
engine which decides a criterion in December, 1997. XML By the described document, a machine becomes legible about 
the contents of the document and a content addressed retrieval becomes possible. Then, that is used in this invention and 
it is WWW. The retrieval inquiry by natural language is realized to a homepage. 

[0018] The text file which contains the pair of the tag which the attribute name which specifically expresses the 
semantics written into the document attached, and the attribute value of the attribute is used as the text file for retrieval. 
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In the text file retrieval equipment with which the part which suits the retrieval conditions which the user specified with 
natural language from the text file for retrieval is searched The attribute name storing dictionary which stores a pair of 
set with the natural language expression expressing an attribute name and its attribute name about the attribute name 
which exists in the text file for retrieval, The attribute value storing dictionary which stores the set of 3 groups with the 
natural language expression which expresses the attribute name corresponding to attribute value and its attribute value, 
and its attribute value about the attribute value which exists in the text file for retrieval, Investigate the retrieval demand 
sentence expressed with natural language sequentially from a head, and an attribute name storing dictionary is referred 
to. If the natural language expression expressing an attribute name is included, will output the attribute name which is the 
natural language expression and pair as an attribute name index, and an attribute value storing dictionary will be referred 
to. The keyword extraction section which will output a pair of set with the natural language expression, the attribute 
value which is 3 groups, and an attribute name as an attribute value index if the natural language expression expressing 
attribute value is included, Only when the output of the keyword extraction section is inputted, it investigates 
sequentially from a head and the attribute name index and attribute value index of the same attribute exist in next doors 
The keyword filter section which deletes said attribute name index and outputs the other part as it is, It investigates 
whether the tag corresponding to the pair of the attribute name of all attribute value indexes and attribute value which 
were outputted from said keyword filter section into the text file for retrieval is built in. When built, it has a retrieval 
means to search and output the attribute value of a tag with the attribute name of the attribute name index outputted from 
said keyword filter section from the text file for retrieval. 

[0019] Thus, if it is in the text file retrieval equipment of constituted this invention When a user inputs the retrieval 
demand sentence specified with natural language, the keyword extraction section If the natural language expression 
which investigates a retrieval demand sentence sequentially from a head, and expresses an attribute name is included The 
attribute name is outputted as an attribute name index, and if the natural language expression expressing attribute value is 
included, a pair of set with the attribute value and attribute name will be outputted as an attribute value index. 
Subsequently Only when the keyword filter section inputs the output of the keyword extraction section, and investigates 
sequentially from a head and the attribute name index and attribute value index of the same attribute exist in next doors, 
said attribute name index is deleted. Subsequently It investigates whether the tag corresponding to the pair of the 
attribute name of all the attribute value indexes with which the retrieval means was outputted from the keyword filter 
section into the text file for retrieval, and attribute value is built in. When built, the attribute value of a tag with the 
attribute name of the attribute name index outputted from the keyword filter section is searched and outputted from the 
text file for retrieval. 

[0020] Moreover, in order to sort out only the text file which fulfills the retrieval conditions which the user inputted with 
natural language from from among the text files of a large number registered beforehand and to enable it to display the 
part which the user in it still needs on a user From the set of the text file to build [ two or more ] in, the pair of the tag 
which the attribute name expressing the semantics written into the document attached, and the value of the attribute In 
the text file retrieval equipment which chooses the text file with which are satisfied of the retrieval conditions which the 
user specified with natural language, and displays the suiting part The text file term document which stores the identifier 
and existence location of all the text files used as the candidate for retrieval, The attribute name storing dictionary which 
stores a pair of set with the natural language expression expressing an attribute name and its attribute name about the - 
attribute name which exists in the text file used as the candidate for retrieval, The attribute value storing dictionary which 
stores the set of 3 groups with the natural language expression which expresses the attribute name corresponding to 
attribute value and its attribute value, and its attribute value about the attribute value which exists in the text file used as 
the candidate for retrieval, If a user inputs the retrieval demand sentence expressed with natural language, he will 
investigate said input statement sequentially from a head, and will refer to an attribute name storing dictionary. If the 
natural language expression expressing an attribute name is included, will output the attribute name which is the natural 
language expression and pair as an attribute name index, and an attribute value storing dictionary will be referred to. The 
keyword extraction section which performs one by one outputting a pair of set with the natural language expression, the 
attribute value which is 3 groups, and an attribute name as an attribute value index if the natural language expression 
expressing attribute value is included, Only when the output of the keyword extraction section is inputted, it investigates 
sequentially from a head and the attribute name index and attribute value index of the same attribute exist in next doors 
The keyword filter section which deletes said attribute name index and outputs the other part as it is, When the contents 
and the attribute value index of a text file are inputted, in the contents of said text file Investigate whether a tag including 
the attribute name in said attribute value index exists, and when it exists Take out the attribute value which exists by the 
tag and pair, and whether the value is equal to the attribute value in said attribute value index investigation and when 
equal With the contents Banking Inspection Department of a document which outputs success, and does a rejected output 
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when that is not right If the contents of the text file and one or more attribute value indexes are inputted Take out one at a 
time from said attribute value index, and the contents and said every one taken-out attribute value index of said text file 
are passed to the contents Banking Inspection Department of a document. To all attribute value indexes, when the output 
is success With the contents Banking Inspection Department of an integrated document which outputs success, and 
outputs a rejection when that is not right Every one contents of the text file are taken out with reference to a text file term 
document. The contents of said document, and the part of the attribute value index of the outputs of the keyword filter 
section to the contents Banking Inspection Department of an integrated document Delivery, The success text file name 
sorting section to which it performs receiving the output of said contents Banking Inspection Department of an integrated 
document to said all text files that it took out one [ at a time ], and said output outputs only the identifier of the text file 
of success, If the contents of a text file name and said text file name and the attribute name index which is the output of 
the keyword filter section are inputted When one of said attribute name indexes is taken out, and it investigates whether a 
tag including said taken-out attribute name exists and exists in the contents of said given text file with the contents output 
section of a document which said inputted attribute name index is alike, respectively, receives, and performs displaying 
the value and said inputted text file name of a tag of the attribute name on a user, and outputting nothing in not existing 
Input the set of the text file name which is the output of said success text file name sorting section, and a text file name 
storing dictionary is referred to. Said every one inputted element of a set of a text file name was taken out, and it has the 
contents output-control section of a document which repeats performing passing the contents output section of a 
document to all the text file names under said input. 

[0021] Thus, an operation of the text file retrieval equipment of constituted this invention is explained using an example, 
in order to make the understanding easy. First, what was shown in drawing 5 (a) and (b) is used as a WWW text file used 
as the object which a user searches. In the text file of drawing 5 , the pair of the attribute name and the attribute value of 
an attribute of an attribute other than a text text is contained. Moreover, the following sentence is used as an example of a 
user's retrieval sentence. 

Retrieval input-statement: "I want to find the information on the hotel which it is possible for a price to take a pet in 150 
dollar, and is moreover near San Francisco" 

[0022] First, an input statement is changed into a keyword train in the 1st step. Two kinds exist as a class of keyword. 
The 1st is a natural language expression which refers to an attribute name, and it calls it an attribute name index. The 2nd 
is a natural language expression which refers to attribute value, and they call it an attribute value index. 
[0023] 

the generated keyword train: {price (attribute name index of a "price"), $150 dol (attribute value index of a "price"), and a 
pet (attribute name index of "pet possibility") - possible (attribute value index of "pet possibility"), San Francisco 
(attribute value index of a "location"), and a hotel (attribute name index of a "hotel name") 

} — (4) [0024] Next, a redundant part is unified with reference to the order of a list of an attribute name index and an 
attribute value index. When the attribute name index and attribute value index to the same attribute are located in a line 
with next doors, the attribute name index is deleted. The upper example of a keyword train is compressed as follows, 
the compressed keyword train: {$150 dol (attribute value index of a "price") - possible (attribute value index of "pet 
possibility"), San Francisco (attribute value index of a "location"), and a hotel (attribute name index of a "hotel name") 
} — (5) [0025] Next, the extracted keyword train is interpreted. An attribute value index is interpreted as the conditional 
expression of taking the value which an attribute value index holds as a value of the attribute which it refers to. For 
example, $ 1 50 Dollar (attribute value index of a "price") 
**, value = $150" of ""price" attribute 
The interpretation to say is carried out. 

[0026] When two or more attribute value indexes exist, what carried out the AND of those interpretations serves as the 
whole conditional expression. In the upper example, it is as follows. 

the whole conditional expression - {-- value = [ ] of the value = $150" and ""pet possibility" attribute of a "price" 
attribute - possible --" and "value = San Francisco of a "location" attribute" 

} - (6) [0027] An attribute name index becomes the interpretation "output the value of the attribute which it refers to." In 
the upper example, it is as follows. 

Specific {hotel of a retrieval part" (attribute name index of a "hotel name") 

} - (7) [0028] This semantics serves as the interpretation "output the value of a "hotel name" attribute." When there are 
two or more attribute name indexes, it becomes the semantics "carry out the sequential output of the attribute name index 
of these plurality." 

[0029] The interpretation of the whole input statement is WWW which satisfies the retrieval conditional expression 
generated from an attribute value index. What is necessary is to extract the value of the attribute name which chooses the 
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upper text file, next is specified by the interpretation of an attribute name index out of those text files, and just to display 

it on a user. 

[0030] 

[Embodiment of the Invention] When drawing 1 is referred to, the text file retrieval equipment 100 of the gestalt of 
operation of this invention The text file term document 1, the attribute name storing dictionary 2, and the attribute value 
storing dictionary 3, With the keyword extraction section 4, the keyword filter section 5, and the contents Banking 
Inspection Department 6 of a document It consists of the contents Banking Inspection Department 7 of an integrated 
document, the success text file name sorting section 8, the contents output section 9 of a document, and the contents 
output-control section 10 of a document, and a display 102 and the Internet 103, such as the input devices 101, such as a 
keyboard, and a CRT display, are accessed. 

[003 1] The identifier and the physical location of all the text files used as the candidate for retrieval are stored in the text 
file term document 1. When the text file used as the candidate for retrieval is described by HTML and XML, a text file is 
WWW in the world. Distributing to the server is also possible. In that case, the location of a text file is the URL 
M http:// Mt is described. 

[0032] The pair with the natural language expression expressing the attribute name and its attribute name of the attribute 
tag which exists in the text file used as the candidate for retrieval is registered into the attribute name storing dictionary 
2. The most fundamental thing in the natural language expression which refers to a certain attribute name is the attribute 
name itself. For example, as a natural language expression which refers to the attribute name of a "hotel", it is a "hotel." 
However, there is an expression which refers to a "hotel" besides it. For example, there are a "stay location" and "an 
expression which stays." It becomes a pair as these are in the following table 1 and show, and is registered. 
[0033] 

[ Table 1] 



[xj ID=000003 



[0034] 3 groups with the natural language expression which expresses the attribute name corresponding to attribute value 
and its attribute value and its attribute value about the attribute value which exists in the text file used as the candidate 
for retrieval are stored in the attribute value storing dictionary 3. It is [ which refers to a certain attribute value ] the 
attribute value itself which is the most fundamental as a natural language expression. There is the "X hotel" itself and 
there is no other way but to be [ a natural-language expression which refers to the attribute value of "X hotel" ] 
fastidious. However, in another example, an expression like animal companion being [ O.K. ] "the lump", pet being 
[ O.K. ] "the company", and "dog and cat being good" may also be registered besides "being possible" as a natural 
language expression showing the attribute value of a "pet" attribute. As shown in the following table 2, data are stored in 
the attribute value storing dictionary 3 in 3 groups. 
[0035] 
[Table 2] 
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[0036] The keyword extraction section 4 is an input unit 101 about the input condition retrieval sentence by natural 
language expression. If it leads and receives from a user, it will investigate whether with reference to the attribute name 
storing dictionary 2 and the attribute value storing dictionary 3, there is any expression registered as a natural language 
expression in it into an input condition retrieval sentence. Only an attribute name is outputted, when it is and it is an 
attribute name. This output is called an attribute name index. On the other hand, when it is attribute value, the pair of 
attribute value and a corresponding attribute name is outputted. This output is called an attribute value index. If these are 
investigated from the head of an input condition retrieval sentence and what matches is found in them, they are outputted 
to the sequence. 

[0037] When the output of the keyword extraction section 4 is investigated sequentially from reception and a head as it is 
and the attribute name index and attribute value index of the same attribute exist in next doors, the keyword filter section 
5 deletes the attribute name index, and outputs the other part transparently as it is. 

[0038] The contents Banking Inspection Department 6 of a document receives as an input the character string and 
attribute value index which are the contents of the text file from the contents Banking Inspection Department 7 of an 
integrated document. It investigates whether the tag which includes the attribute name in the received attribute value 
index in the contents character string of the text file received as an input exists, and when it exists, the attribute value 
which exists by that tag and pair is taken out, investigation etc. is carried out [ whether that value is equal to the attribute 
value in this attribute value index, and ], and it is, and success is outputted, and a case carries out a rejected output, when 
that is not right. The contents Banking Inspection Department 6 of a document has played a kind of subroutine-role 
which is called from the contents Banking Inspection Department 7 of an integrated document, and operates. 
[0039] The contents Banking Inspection Department 7 of an integrated document receives as an input the character string 
and one or more attribute value indexes which are the contents of the text file from the success text file name sorting 
section 8. The given attribute value index considers that each is expressing the conditional expression M the value 
described in the attribute value index must be taken as a value of the attribute described in the attribute value index." The 
role of the contents Banking Inspection Department 7 of an integrated document is investigating whether the condition 
being satisfied by finding the attribute expression described by the attribute value index out of the given character string. 
If all the conditions of one or more attribute value indexes given as an input are satisfied, the value of "success" is 
outputted, and when that is not right, the value of a "rejection" will be outputted. The contents Banking Inspection 
Department 6 of a document performs judging whether the character string which is the contents of the text file actually 
satisfies the conditions of one attribute value index. The contents Banking Inspection Department 7 of an integrated 
document is performing a kind of loop control which passes every one attribute value index to the contents Banking 
Inspection Department 6 of a document one by one, when there are two or more attribute value indexes. It is the 
subroutine-role to which the contents Banking Inspection Department 7 of an integrated document is also called from the 
success text file name sorting section 8. 

[0040] The success text file name sorting section 8 is WWW currently distributed all over the world through the Internet 
103 if needed with reference to the text file term document 1. A server is accessed, every one contents of the text file are 
taken out, and the output of delivery and the contents Banking Inspection Department 7 of an integrated document is 
received for the part of an attribute value index to the contents Banking Inspection Department 7 of an integrated 
document among the contents of this document, and the output of the keyword filter section 5. As an output, the value of 
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"success" or a "rejection" is returned here. This processing is performed to all the files registered into the text file term 
document 1, and a text file name is outputted to the contents output-control section 10 of a document only to the file 
whose output of the contents Banking Inspection Department 7 of an integrated document was "success." 
[0041] The contents output section 9 of a document inputs a text file name, the contents of this file, and one or more 
attribute name indexes that are the outputs of the keyword filter section 5. Take out one of the inputted attribute name 
indexes, and in the contents of the inputted text file Investigate whether a tag including the attribute name in this attribute 
name index exists, and when it exists It is a display 102 about a pair with the text file name inputted as the value of the 
attribute value tag corresponding to the attribute name tag. It leads and processing in which nothing is outputted when it 
does not display and exist in a user is performed to each of all the inputted attribute name indexes. The contents output 
section 9 of a document is carrying out the role called in subroutine by the contents output-control section 10 of a 
document. In addition, instead of outputting the pair of the value of an attribute value tag, and a text file name, you may 
make it display the value of an attribute value tag, and the positional information of a text file, and may make it display 
the value of an attribute value tag, a text file name, and its positional information. 

[0042] The contents output-control section 10 of a document considers the set of the text file name which is the output of 
the success text file name sorting section 8 as an own input as it is, and the text file term document 1 is referred to. The 
need is accepted in the contents of the text file under set of the inputted text file name, and it is the Internet 103. It leads 
and is WWW. Access a server and it takes out one at a time. It repeats performing passing the contents output section 9 
of a document to all the text file names under input with the attribute name index generated in a text file name and the 
keyword filter section 5. That is, when three text file names which passed as an input are received, the contents output 
section 9 of a document will be called 3 times.. In addition, the success text file name sorting section 8 is the Internet 103. 
It is the Internet 103 by the contents output-control section 10 of a document using the contents, when the contents of the 
text file which it led and was incorporated from the WWWW server are saved at the magnetic disk drive etc. The count 
of access can be reduced. 

[0043] Drawing 2 and drawing 3 are text file retrieval equipment 100. It is the flow chart which shows the example of 
processing. Hereafter, actuation of the gestalt of this operation is explained. 

[0044] The keyword extraction section 4 is an input unit 101. If it leads and the retrieval input statement by natural 
language expression is received from a user (step SI), the attribute name storing dictionary 2 and the attribute value 
storing dictionary 3 will be referred to. Investigate sequentially from the head of a retrieval input statement, and when it 
is, whether there is any expression registered as a natural language expression in it in a retrieval input statement When it 
is an attribute name, the attribute name index only containing an attribute name is outputted, and when it is attribute 
value, the attribute value index containing the pair of attribute value and a corresponding attribute name is outputted 
(step S2). 

[0045] Next, the keyword filter section 5 inspects the list of the index outputted from the keyword extraction section 4, 
detects the part where the attribute name index and attribute value index of the same attribute are continuing, and deletes 
the attribute name index of the part (step S3). 

[0046] Next, the success text file name sorting section 8 takes out the contents of the document of the text file name, and 
makes the contents Banking Inspection Department 7 of an integrated document judge delivery and success or failure 
paying attention to one text file name in the text file term document 1 with all the attribute value indexes outputted from 
the keyword filter section 5 (step S4). 

[0047] The contents Banking Inspection Department 7 of an integrated document makes the contents Banking Inspection 
Department 6 of a document judge delivery and success or failure for this attribute value index and the contents of the 
text file paying attention to one of the attribute value indexes passed first, in order to inspect the passed contents of a 
document (step S5). 

[0048] The attribute name tag with which the contents Banking Inspection Department 6 of a document has the attribute 
name included in the passed attribute value index in the contents of the passed text file exists. And the value of the 
attribute value tag which is the attribute name tag which existed and pair or [ that inspect whether it is in agreement with 
the attribute value included in the passed attribute value index, and such an attribute name tag does not exist success in 
being in agreement ] — or even if it exists, when the attribute value is not in agreement, the contents Banking Inspection 
Department 7 of an integrated document is notified of a rejection (step S6) . 

[0049] When, as for the contents Banking Inspection Department 7 of an integrated document, success is notified from 
the contents Banking Inspection Department 6 of a document (it is YES at step S7), It investigates whether it finish 
inspecting about all the attribute value indexes notified from the success text file name sorting section 8. When having 
finished inspecting still, attention is moved to one of NO) and the remaining attribute value indexes at the (step S8, and 
the contents Banking Inspection Department 6 of a document is made to judge delivery and success or failure for the 
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attribute value index and contents of the text file (step S9). And when success is declared about all attribute value 
indexes in the contents Banking Inspection Department 7 of a document (it is YES at step S8), success is notified to the 
success text file name sorting section 8, the success text file name sorting section 8 uses the text file concerned as a 
success text file (step S10), and it progresses to step SI 1. On the other hand, when a rejection is notified from the 
contents Banking Inspection Department 6 of a document (it is NO at step S7), the contents Banking Inspection 
Department 7 of an integrated document notifies a rejection to the success text file name sorting section 8, and the 
success text file name sorting section 8 progresses to step SI 1 . 

[0050] After the yes-no decision about one text file finishes, when the unsettled text file remains into the text file term 
document 1 (it is YES at step SI 1), the success text file name sorting section 8 moves attention to one of text file names 
[ them ] (step SI 2), and makes the judgment of success or failure like a previous text file. 

[005 1] If it judges whether the success text file name sorting section 8 had at least one success file (step SI 3) and there is 
no one after finishing the yes-no decision to all the text files in the text file term document 1 (it is YES at step SI 1), the 
text file corresponding to the retrieval conditions inputted, for example will process displaying the purport which one did 
not have on a user etc., and will end processing. On the other hand, when at least one success file exists, the text file 
name of all that success file and all the attribute name indexes outputted from the keyword filter section 5 are notified to 
the contents output-control section 10 of a document, and the contents output control of a document is made to start (step 
SI 4). 

[0052] The contents output-control section 10 of a document takes out the contents of a document paying attention to 
one notified success file name, and the contents output section 9 of a document is made to start processing of delivery 
and the document concerned with all the notified attribute name indexes (step SI 5). 

[0053] It investigates whether the contents output section 9 of a document has an attribute name tag with the attribute 
name of the attribute name index in a document paying attention to one notified attribute name index (step SI 6) (step 
SI 7), and if it is (it is YES at step SI 8), the value and the text file name concerned of the attribute value tag 
corresponding to the attribute name tag will be displayed on a display 102 (step SI 9). If there is nothing (it is NO at step 
SI 8), step SI 9 will be skipped. Next, if the contents output section 9 of a document investigated whether the unsettled 
attribute name index would remain on the notified attribute name index (step S20) and it remains in it, it will move' 
attention to one of them (step S21), and will repeat the processing returned and mentioned above to step SI 7. 
[0054] After finishing the processing about all attribute name indexes in which the contents output section 9 of a 
document was notified (it is NO at step S20), the contents output-control section 10 of a document When investigating 
whether the unsettled thing remains to the text file notified from the success text file name sorting section 8 (step S22) 
and remaining in it, move attention to one of them, and the contents of a document of the text file name are taken out. 
The contents output section 9 of a document is passed, and is made to process with all the attribute name indexes notified 
from the success text file name sorting section 8 (step S23). It becomes processing termination after the processing about 
all success files finishes (it is YES at step S22). 
[0055] 

[Example] Suppose that three text file names, "a file 1", "a file 2", and "a file 3", and URL of those are registered into the 
text file term document 1 so that it may illustrate to drawing 1 . Moreover, the contents of the file 1 show drawing 5 (a), 
and the contents of the file 2 presuppose that it is what is shown in drawing 5 (b). These files 1 and 2 are XML. It is 
described and the attribute and attribute value other than a text text are included. That is, the pair of the tag which the 
attribute name expressing the semantics written into the document of a <hotel> X hotel «location> San Francisco 
</location> <price> $150 </price> <pet> possible < / pet> attached, and the value of the attribute is contained in the file 
1 . Similarly, a tag called a <hotel> Z hotel «location> Seattle </location> <price> $180 </price> <pet> improper < / 
pet> is contained also in the file 2. 

[0056] Moreover, the pair of an attribute name which is illustrated to drawing 1 , and its natural language expression 
shall be stored in the attribute name storing dictionary 2 in advance, and 3 groups of attribute value and a natural 
language expression which are illustrated to drawing 1 , and an attribute name shall be stored in the attribute value 
storing dictionary 3 in advance. In addition, if all prices are registered into the attribute value storing dictionary 3 with 
the actual value, since a number of registration will increase, you may make it register using a variable. That is, XXX If 
it registers as it is shown in the following table 3, when considering as the numeric value of arbitration, and the keyword 
extraction section 4 has a "dollar" behind the numeric value of arbitration, it will be the natural language expression 
XXX. Let what judged that a dollar existed and attached $ to the head of the actual value which existed be attribute 
value. * 
[Table 3] 
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[0057] Actuation of this example is explained to an example for the case where a user inputs the retrieval input statement 
by the following natural language by such premise. 

Retrieval input-statement: "I want to find the information on the hotel which it is possible for a price to take a pet in 150 
dollar, and is moreover near San Francisco" 

[0058] The keyword extraction section 4 will change a retrieval input statement into a keyword train as follows with 
reference to the attribute name storing dictionary 2 and the attribute value storing dictionary 3, if the retrieval input 
statement from a user is received. 

[0059] First, since the natural language expression "a price" of the head of a retrieval input statement exists in the 
attribute name storing dictionary 2, the attribute name "a price" registered into it and a pair by becoming is outputted as 
an attribute name index. Next, since a natural language expression "150 Dollar" exists in the attribute value storing 
dictionary 3, the pair of the attribute value "$150" and the attribute name "the price" which are registered in it and 3 
groups is outputted as an attribute value index. Next, since a natural language expression "a pet" exists in the attribute 
name storing dictionary 2, the attribute name "a pet" registered into it and a pair by becoming is outputted as an attribute 
name index. Next, since a natural language expression "possible" exists in the attribute value storing dictionary 3, the 
pair of the attribute value "possible" and the attribute name "the pet" which are registered in it and 3 groups is outputted 
as an attribute value index. Next, since a natural language expression "San Francisco" exists in the attribute value storing 
dictionary 3, the pair of the attribute value "San Francisco" and the attribute name "the location" which are registered in 
it and 3 groups is outputted as an attribute value index. Next, since a "hotel" exists in the attribute name storing 
dictionary 2, the attribute name "a hotel" registered into it and a pair by becoming is outputted as an attribute name 
index. Into a retrieval input statement, other natural language expressions which match the natural language expression 
registered into the attribute name storing dictionary 2 and the attribute value storing dictionary 3 cannot be found. 
Therefore, the following keyword trains are outputted sequentially from a top. 
[0060] Attribute name index (attribute name "a price") 
Attribute value index (attribute value "$150", attribute name "a price") 
Attribute name index (attribute name "a pet") 

Attribute value index (attribute value "possible", attribute name "a pet") 

Attribute value index (attribute value " "San Francisco", attribute name "a location") 

Attribute name index (attribute name "a hotel") 

[0061] Next, the keyword filter section 5 unifies a redundant part with reference to the order of a list of an attribute name 
index and an attribute value index. Since it has ranked with next doors by the attribute name "a price" with same attribute 
name index (attribute name "price") and attribute value index (attribute value "$150", attribute name "price") in the case 
of the upper keyword train, an attribute name index (attribute name "a price") is deleted. Moreover, since it has ranked 
with next doors by the attribute name "a pet" with same attribute name index (attribute name "pet") and attribute value 
index (attribute value "possible", attribute name "pet"), an attribute name index (attribute name "a pet") is deleted. Since 
the attribute name index which should otherwise be deleted does not exist, finally the above-mentioned keyword train is 
compressed as follows. 

[0062] (a) Attribute value index (attribute value "$150", attribute name "a price") 

(b) Attribute value index (attribute value "possible", attribute name "a pet") 

(c) Attribute value index (attribute value " "San Francisco", attribute name "a location") 

(d) Attribute name index (attribute name "a hotel") 

[0063] next, the success text file name sorting section 8 - the contents of a document of the file 1 in the text file term 
document 1 — the URL — reliance - the Internet 103 from the server which leads and corresponds ~ acquiring -- the 
contents of a document, and the above-mentioned attribute value index (a) - (c) The contents Banking Inspection 
Department 7 of an integrated document is passed. 

[0064] The contents Banking Inspection Department 7 of an integrated document is the contents of a document of a file 

1, and one attribute value index (a). The contents Banking Inspection Department 6 of a document is passed. 

[0065] The contents Banking Inspection Department 6 of a document is an attribute value index (a) in the contents of a 
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document of a file 1 . It investigates whether the tag of an inner attribute name "a price" exists, drawing 5 — (~ a — ) — a 
file — one - a case -- corresponding — a tag — < — a price -->--$— 1 50 — < — /— a price — > — it is ~ since - the — 
attribute value — "-- $ -- 150 - " - having received - attribute value - an index — (— a --) It investigates whether it is in 
agreement with inner attribute value "$150." In the present example, since it is in agreement, success is returned to the 
contents Banking Inspection Department 7 of an integrated document. 

[0066] The contents Banking Inspection Department 7 of an integrated document is the contents of a document of a file 
1, and the following attribute value index (b). The contents Banking Inspection Department 6 of a document is passed. 
[0067] The contents Banking Inspection Department 6 of a document is an attribute value index (b) in the contents of a 
document of a file 1. It investigates whether the tag of an inner attribute name "a pet" exists, drawing 5 » (— a --) - a file 
— one - a case - corresponding - a tag. — < a pet - > - possible - < - /— a pet -- > — it is — since -- the attribute 
value — "— possible — " — having received — attribute value — an index — (— b — ) It investigates whether it is in 
agreement with inner attribute value "possible." In the present example, since it is in agreement, success is returned to 
the contents Banking Inspection Department 7 of an integrated document. 

[0068] The contents Banking Inspection Department 7 of an integrated document is the contents of a document of a file 
1, and the following attribute value index (c). The contents Banking Inspection Department 6 of a document is passed. 
[0069] The contents Banking Inspection Department 6 of a document is an attribute value index (c) in the contents of a 
document of a file 1. It investigates whether the tag of an inner attribute name "a location" exists, drawing 5 — (— a — ) — 
a file — one - a case -- corresponding — a tag — < — a location - > - San Francisco -- < — /— a location - > -- it is — 
since - the - attribute value — "-- San Francisco " - having received — attribute value — an index - (-- c — ) It 
investigates whether it is in agreement with inner attribute value "San Francisco." In the present example, since it is in 
agreement, success is returned to the contents Banking Inspection Department 7 of an integrated document. 
[0070] Since the result of success was obtained by all attribute value indexes about the file 1, the contents Banking 
Inspection Department 7 of an integrated document notifies success to the success text file name sorting section 8, and 
the success text file name sorting section 8 considers a file 1 as a success file. 

[007 1 ] Next, the success text file name sorting section 8 is the Internet 1 03 to reliance about the URL in the contents of a 
document of the file 2 stored in the text file term document 1 . It leads, and incorporates from the corresponding server 
and success or failure is judged using the contents Banking Inspection Department 7 of an integrated document like the 
previous file 1. In this case, since a pet attribute and a location attribute are not satisfied, a file 2 serves as a rejection. 
Similarly, the judgment of success or failure is performed also about the remaining file 3. Here, a file 3 is also judged to 
be a rejection and a success file presupposes that it was it only a file 1. 

[0072] Next, the success text file name sorting section 8 is an attribute name index (d) about a file name 1 as a success 
file name. The contents output-control section 10 of a document is passed. 

[0073] The contents output-control section 10 of a document acquires URL of a file name 1 from the text file term 
document 1, and is the Internet 103 to reliance about it. The upper server is accessed, the contents of a document of a file 
name 1 are acquired, and it is an attribute name index (d). The contents output section 9 of a document is passed. 
[0074] The contents output section 9 of a document is an attribute name index (d) in the contents of a document of a file 
1. It investigates whether an attribute tag with an attribute name "a hotel" exists, since the corresponding tag <hotel> X 
hotel <a /hotel> exists in the case of the file 1 of drawing 5 (a) ~ the attribute value "X hotel" taking out - a file name 
1 — display 102 It displays. 

[0075] Drawing 4 is the block diagram showing the example of a hardware configuration of the text file retrieval 
equipment of this invention. The text file retrieval equipment of this example A processor 200 (CPU), The auxiliary 
storage units 201, such as a magnetic disk drive, and the interface 202 of those, The memory 203, such as RAM, and the 
interface 204 between the Internet 103, The record medium 205 which can machine read CD-ROM, semiconductor 
memory, etc., It consists of buses 209 which connect between the interface 206, an input unit 101, its interface 207, a 
display 102, its interface 208, and CPU200, memory 203 and an interface 202,204,206-208. 

[0076] The program for text file retrieval is recorded on the record medium 205, and the text file term document 1, the 
attribute name storing dictionary 2, and the attribute value storing dictionary 3 of drawing 1 are loaded on memory 203 
or an auxiliary storage unit 201 by installing this program through an interface 206. Moreover, this program operates 
CPU200 by controlling actuation of CPU200 as the keyword extraction section 4 of drawing 1 , the keyword filter 
section 5, the contents Banking Inspection Department 6 of a document, the contents Banking Inspection Department 7 
of an integrated document, the success text file name sorting section 8, the contents output section 9 of a document, and 
the contents output-control section 10 of a document. 

[0077] The gestalt of the above operation is WWW about this invention. Although applied to retrieval of the upper 
homepage, it is applicable also to the retrieval to a database like the technique indicated to JP,5-67136,A. 
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[0078] 

[Effect of the Invention] According to this invention, the following effectiveness is acquired as explained above. 
[0079] The redundancy of the reply to the retrieval inquiry by natural language can be abolished as much as possible. 
The reason is that it has the keyword filter section which deletes an attribute name index when the index train extracted 
in the keyword extraction section is investigated sequentially from the head and the attribute name index and attribute 
value index of the same attribute exist in next doors. 

[0080] WWW The retrieval inquiry by natural language becomes possible also to a homepage. The reason is XML. It is 
because the attribute value which a user desires can be taken out out of a text file with the attribute name and attribute 
value which interpret the retrieval conditions by the natural language which was made to build in the tag of the attribute 
name expressing the semantics, and attribute value, and the user inputted into the text file like, and suit. WWW by the 
natural language interface which receives the input of various forms, such as a fragment of a right natural language 
expression, a nongrammatical expression, and a natural language sentence, and a keyword train, grammatically, and 
interprets systematically by this A document-retrieval system is realizable. 



[Translation done.] 
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2 **** s hows the word which can not be translated. 
3. In the drawings, any words are not translated. 

TECHNICAL FIELD 

[Field of the Invention] Especially this invention relates to the text file retrieval equipment which made the retrieval 
inquiry by natural language possible about text file retrieval equipment. 



[Translation done.] 
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PRIOR ART 

[Description of the Prior Art] When it is going to make a precision express a retrieval intention of a user more in 
information retrieval generally, the method of making it express with natural language like Japanese or English is 
effective. The system which performs retrieval to a database with natural language already exists, and is called the 
natural language interface (bibliography: DIBEROPPINGUANACHURARURANGEJI interface two complex data and 
G . G . ''Developing a Natural Language Interface to Complex Data" besides Hendrix, ACM Trans.on Database Systems, 
1978.). 

[0003] The conventional natural language interface interprets the retrieval inquiry by a user's natural language, changes 
the inquiry into the retrieval type of the query language (SQL) of a database, and shows a user the retrieval result to 
which the retrieval type was returned by database system from delivery and database system. 

[0004] However, although commercialization already started and the conventional natural language interface left for 20 
years or more, it has not reached the level of practical use yet. Since the functor in which a natural language interface 
system cannot interpret a free inquiry of a user, but the system permits one of the reason of the, and a vocabulary are 
limited clearly, a user is for having to memorize what kind of expression can be used after all. Therefore, a natural 
language interface is not different from a complicated command system. That is, it was a problem to say that the 
conventional natural language interface cannot receive a user's free expression (bibliography: 

DIBEROPPINGUANACHURARURANGEJI interface two complex data and G . G . "Developing a Natural Language 
Interface to Complex Data" besides Hendrix, ACM Trans.on Database Systems, 1978.). It was especially a problem that 
it is not allowed although he wants to use not the natural language with a perfect user about an easy question but a brief 
expression. As an example of a brief expression, there are a part of expression by the keyword train, nongrammatical 
expression or natural language sentence, and **. 

[0005] Then, these people proposed the "natural language interpretation approach" of realizing retrieval by the natural 
language of a brief expression, in previous patent application (JP,5-67136,A). This makes applicable to retrieval the table 
of the database which is the assembly of the group of the attribute name and the attribute value of an attribute of an 
attribute. Each word in the inquiry sentence by natural language is classified into an attribute name, attribute value, and 
others. Save the word classified into the attribute name as a response attribute name, and the word classified into 
attribute value makes a group the attribute value and the attribute name corresponding to it, and it saves it as condition 
attribute value ****. When all the groups of the attribute value in this saved condition attribute value **** and an 
attribute name exist in a table, the attribute value in said table corresponding to each attribute name saved as said 
response attribute name is outputted as a reply to an inquiry. In addition, there is the "natural language interpretation 
approach" which starts JP,5-242147,A for which these people applied previously too as a conventional technique similar 
to this. 

[0006] On the other hand, it follows on expansion of use of World Wide Web (WWW) in recently, and is WWW. The 
importance of the upper retrieval technique is increasing. WWW A user is WWW. The typical tool used when carrying 
out information retrieval in a top is a search engine. As this example, Altavista, Infoseek, Lycos, etc. are famous. 
However, in a search engine, since it is the format of the retrieval which combines a keyword, the intention of retrieval 
of a user is not reflected directly in many cases. For example, a price is 150 when looking for the information about a 
hotel. It is possible to take a pet in a dollar, and it is impossible to express such a retrieval intention only along a keyword 
to find the homepage of the hotel which is moreover near San Francisco. Temporarily, it is (1). It expresses by the 
formula. 150 A dollar, San Francisco, a hotel, and a pet are possible. - If (1) and a keyword are put in order, the list of 
very a lot of homepages will be outputted. 

[0007] A natural language interface system is introduced and it is WWW. If a homepage can be searched, precise 
retrieval conditions like the upper example will be expressed obediently, and only a suitable homepage can be searched. 
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However, WWW The example which applied the natural language interface to retrieval of the upper homepage is not 
found. 

[0008] in addition, WWW although there be "information retrieval equipment" indicated by JP, 10-40262, A as other 
examples of the retrieval technique over a homepage, since this aim at the information retrieval suitable for the 
sensibility of the user who do not have the retrieval conditions for [ clear ] retrieval by enable retrieval which made 
sensibility expression data the keyword, it be fit for retrieval by precise retrieval conditions like the upper example. 

[Translation done.] 
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EFFECT OF THE INVENTION 

[Effect of the Invention] According to this invention, the following effectiveness is acquired as explained above. 
[0079] The redundancy of the reply to the retrieval inquiry by natural language can be abolished as much as possible. 
The reason is that it has the keyword filter section which deletes an attribute name index when the index train extracted 
in the keyword extraction section is investigated sequentially from the head and the attribute name index and attribute 
value index of the same attribute exist in next doors. 

[0080] WWW The retrieval inquiry by natural language becomes possible also to a homepage. The reason is XML. It is 
because the attribute value which a user desires can be taken out out of a text file with the attribute name and attribute 
value which interpret the retrieval conditions by the natural language which was made to build in the tag of the attribute 
name expressing the semantics, and attribute value, and the user inputted into the text file like, and suit. WWW by the 
natural language interface which receives the input of various forms, such as a fragment of a right natural language 
expression, a nongrammatical expression, and a natural language sentence, and a keyword train, grammatically, and 
interprets systematically by this A document-retrieval system is realizable. 

[Translation done,] 
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TECHNICAL PROBLEM 

[Problem(s) to be Solved by the Invention] Although the technique indicated by JP,5-67136,A mentioned above can 
serve as an effective means when putting a natural language interface system in practical use since it can interpret the 
inquiry by natural language by the simple approach, the technical problem which should still be solved is left behind. 
Since it is surely treated as a response attribute name and is included during the reply to an inquiry when a certain 
attribute name exists in the inquiry sentence by natural language, it is that a reply may become redundancy. 
[0010] For example, a table with "an attribute name = title, its attribute value = No Longer Human, an attribute name = 
author, and its attribute value = Dazaiosamu" is received. When the inquiry "the author of title = No Longer Human is ?" 
is performed, first Subsequently, "No Longer Human" is judged to be attribute value, a group with the "title" which is 
attribute value "No Longer Human" and its attribute name is saved as condition attribute value ****, a "title" is judged to 
be an attribute name, and it is saved as a response attribute name, and he is saved [ an "author" is judged to be an 
attribute name and ] as a response attribute name. And said table which has the attribute value in condition attribute 
value **** "No Longer Human" and a group with an attribute name "a title" is searched, and the attribute value "No 
Longer Human" and "Dazaiosamu" corresponding to a response attribute name "a title" and an "author" are searched and 
outputted from the table. That is, the part and reply which are outputting "No Longer Human" are redundancy. 
[001 1] Then, the purpose of this invention is to abolish the redundancy of the reply to the retrieval inquiry by natural 
language as much as possible. 

[0012] Moreover, other purposes of this invention are WWW. It is in making the retrieval inquiry by natural language 
possible also to a homepage. 

[Translation done.] 
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MEANS 

[Means for Solving the Problem] (1) In the technique indicated by JP,5-67136,A in which the 1st carried out invention 
****, although the user asked the attribute value corresponding to it and the reason the reply is redundancy has described 
it in a sentence when an attribute name exists in the inquiry sentence by natural language, it is because it is treating as a 
response attribute name uniformly. Then, when the attribute name and attribute value of the same attribute appear in next 
doors, he is trying not to include the attribute name in a response attribute name in this invention paying attention to the 
point in the inclination which a user adjoins them mutually and is described when describing a certain attribute name and 
the attribute value corresponding to it, in order to specify retrieval conditions in the inquiry sentence by natural language. 
The text file which contains the pair of the attribute name and the attribute value of an attribute of an attribute is more 
specifically used as the text file for retrieval. In the text file retrieval equipment with which the part which suits the 
retrieval conditions which the user specified with natural language from the text file for retrieval is searched Investigate 
the retrieval demand sentence expressed with natural language sequentially from a head, and the attribute name is 
outputted as an attribute name index to the natural language expression expressing an attribute name. The keyword 
extraction section which performs outputting the pair of the attribute value and attribute name as an attribute value index 
to the natural language expression expressing attribute value one by one, Only when the output of said keyword 
extraction section is inputted, it investigates sequentially from a head and the attribute name index and attribute value 
index of the same attribute exist in next doors The keyword filter section which deletes said attribute name index and 
outputs the other part as it is, It investigates whether the pair of the attribute name of all attribute value indexes and 
attribute value which were outputted from said keyword filter section into the text file for retrieval is built in. When built, 
it has a retrieval means to search and output the attribute value corresponding to the attribute name of the attribute name 
index outputted from said keyword filter section from the text file for retrieval. 

[0014] Furthermore, in order to enable it to judge correctly which natural language expression which natural language 
expression in a retrieval demand sentence expresses an attribute name, and is expressing attribute value The attribute 
name storing dictionary which stores the pair with the natural language expression expressing an attribute name and its 
attribute name about the attribute name which exists in the text file for retrieval, It has the attribute value storing 
dictionary which stores 3 groups with the natural language expression which expresses the attribute name corresponding 
to attribute value and its attribute value, and its attribute value about the attribute value which exists in the text file for 
retrieval. Said keyword extraction section investigates the retrieval demand sentence expressed with natural language 
sequentially from a head, and refers to an attribute name storing dictionary. If the natural language expression expressing 
an attribute name is included, will output the attribute name which is the natural language expression and pair as an 
attribute name index, and an attribute value storing dictionary will be referred to. If the natural language expression 
expressing attribute value is included, it has the configuration which outputs a pair of set with the natural language 
expression, the attribute value which is 3 groups, and an attribute name as an attribute value index. 
[0015] Thus, if it is in the text file retrieval equipment of constituted this invention When a user inputs the retrieval 
demand sentence expressed with natural language, first the keyword extraction section Investigate a retrieval demand 
sentence sequentially from a head, and the attribute name is outputted as an attribute name index to the natural language 
expression expressing an attribute name. To the natural language expression expressing attribute value, the pair of the 
attribute value and attribute name is outputted as an attribute value index. Oniy when the keyword filter section inputs 
the output of the keyword extraction section, and investigates sequentially from a head and the attribute name index and 
attribute value index of the same attribute exist in next doors, an attribute name index is deleted. Subsequently, 
subsequently It investigates whether the pair of the attribute name of all the attribute value indexes with which the 
retrieval means was outputted from the keyword filter section into the text file for retrieval, and attribute value is built in. 
When built, it has prevented the reply to a user becoming redundancy by searching and outputting the attribute value 
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corresponding to the attribute name of the attribute name index outputted from the keyword filter section from the text 
file for retrieval. 

[0016] (2) 2nd invention WWW For applying a natural language interface to retrieval of the upper homepage, the 
difficult reason is WWW. The contents of the homepage are the files which consist of the texts and drawings which were 
drawn with natural language, and it is not being the set of an attribute name and attribute value like a database. That is, 
the target database [ interface system / which is the former / natural language ] is supposing there is a database of the 
hotel which comes out in the example of the term of a Prior art since it was the set of an attribute name and attribute 
value (identifier: X hotel, price:$150, petpossibility of, location:San Francisco), 
(Identifier: Y hotel, price:$200, pefcimproper, locatiomLos Angeles) 
(Identifier: Z hotel, price:$180, petimproper, location: Seattle) 

** - it stores with a gestalt [ like ] having — ****-- an inquiry of a user — (2) it expresses by the formula - as - "-- 
price =$150 and pet = — possible and location = San Francisco" 

— (2) It was [ like ] convertible. This is SQL. It is changed into language as it is. However, WWW Since the information 
on such [ usually ] an attribute name and attribute value is not contained, a homepage is SQL. It is not convertible for the 
retrieval type which can be expressed by the formula. That is, the conventional WWW Homepage creation language is 
HTML (bibliography: the homepage of a World-Wide-Web consortium, URL http://www.w3.org). In HTML, the group 
of the attribute name for expressing the configuration and attribute value is built in in the text file. For example, it is the 
example of an HTML file which is shown in drawing 6 . here — < — what it is the attribute tag which was surrounded by 
> and used independently (example : <B>) Initiation tag (example : <TR>) Termination tag (example : </TR>) There are 
some which are used by the pair. The description of a HTML tag is limited to it defining the expression of the 
appearance in a text file. For example, the tag made a table-expression is <TABLE> and the tag showing line feed is 
expressed by <P>. It is WWW about such an HTML file. If it is made to read into a browser, it will become a gestalt as 
shown in drawing 7 , and will be indicated by the output at a user. However, HTML cannot define the tag for expressing 
the semantics in a document. 

[0017] So, at this invention, it is WWW. The group of the attribute name and attribute value expressing the semantics in 
the text file is made to build in a text file. XML (Extensible Markup Language) specifically extended so that the contents 
of the document could be expressed by a pair of set of an attribute tag and its attribute value in a file describes a 
document (bibliography: the homepage of a World- Wide- Web consortium, "EKUSUTENSHIBURU markup language 
1.0 M http://www. w3.org/TR/PR-xml -971208). . XML WWW The specification was proposed by the World- Wide- Web 
consortium (bibliography: the homepage of a World- Wide- Web consortium, URL http://www.w3.org) which is the 
engine which decides a criterion in December, 1997. XML By the described document, a machine becomes legible about 
the contents of the document and a content addressed retrieval becomes possible. Then, that is used in this invention and 
it is WWW. The retrieval inquiry by natural language is realized to a homepage. 

[0018] The text file which contains the pair of the tag which the attribute name which specifically expresses the 
semantics written into the document attached, and the attribute value of the attribute is used as the text file for retrieval. 
In the text file retrieval equipment with which the part which suits the retrieval conditions which the user specified with 
natural language from the text file for retrieval is searched The attribute name storing dictionary which stores a pair of 
set with the natural language expression expressing an attribute name and its attribute name about the attribute name 
which exists in the text file for retrieval, The attribute value storing dictionary which stores the set of 3 groups with the 
natural language expression which expresses the attribute name corresponding to attribute value and its attribute value, 
and its attribute value about the attribute value which exists in the text file for retrieval, Investigate the retrieval demand 
sentence expressed with natural language sequentially from a head, and an attribute name storing dictionary is referred 
to. If the natural language expression expressing an attribute name is included, will output the attribute name which is the 
natural language expression and pair as an attribute name index, and an attribute value storing dictionary will be referred 
to. The keyword extraction section which will output a pair of set with the natural language expression, the attribute 
value which is 3 groups, and an attribute name as an attribute value index if the natural language expression expressing 
attribute value is included, Only when the output of the keyword extraction section is inputted, it investigates 
seauentiallv from a head and the attribute name index and attribute value index of the same attribute exist in next doors 
The keyword filter section which deletes said attribute name index and outputs the other part as it is, It investigates 
whether the tag corresponding to the pair of the attribute name of all attribute value indexes and attribute value which 
were outputted from said keyword filter section into the text file for retrieval is built in. When built, it has a retrieval 
means to search and output the attribute value of a tag with the attribute name of the attribute name index outputted from 
said keyword filter section from the text file for retrieval. 

[0019] Thus, if it is in the text file retrieval equipment of constituted this invention When a user inputs the retrieval 
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demand sentence specified with natural language, the keyword extraction section If the natural language expression 
which investigates a retrieval demand sentence sequentially from a head, and expresses an attribute name is included The 
attribute name is outputted as an attribute name index, and if the natural language expression expressing attribute value is 
included, a pair of set with the attribute value and attribute name will be outputted as an attribute value index. 
Subsequently Only when the keyword filter section inputs the output of the keyword extraction section, and investigates 
sequentially from a head and the attribute name index and attribute value index of the same attribute exist in next doors, 
said attribute name index is deleted. Subsequently It investigates whether the tag corresponding to the pair of the 
attribute name of all the attribute value indexes with which the retrieval means was outputted from the keyword filter 
section into the text file for retrieval, and attribute value is built in. When built, the attribute value of a tag with the 
attribute name of the attribute name index outputted from the keyword filter section is searched and outputted from the 
text file for retrieval. 

[0020] Moreover, in order to sort out only the text file which fulfills the retrieval conditions which the user inputted with 
natural language from from among the text files of a large number registered beforehand and to enable it to display the 
part which the user in it still needs on a user From the set of the text file to build [ two or more ] in, the pair of the tag 
which the attribute name expressing the semantics written into the document attached, and the value of the attribute In 
the text file retrieval equipment which chooses the text file with which are satisfied of the retrieval conditions which the 
user specified with natural language, and displays the suiting part The text file term document which stores the identifier 
and existence location of all the text files used as the candidate for retrieval, The attribute name storing dictionary which 
stores a pair of set with the natural language expression expressing an attribute name and its attribute name about the 
attribute name which exists in the text file used as the candidate for retrieval, The attribute value storing dictionary which 
stores the set of 3 groups with the natural language expression which expresses the attribute name corresponding to 
attribute value and its attribute value, and its attribute value about the attribute value which exists in the text file used as 
the candidate for retrieval, If a user inputs the retrieval demand sentence expressed with natural language, he will 
investigate said input statement sequentially from a head, and will refer to an attribute name storing dictionary. If the 
natural language expression expressing an attribute name is included, will output the attribute name which is the natural 
language expression and pair as an attribute name index, and an attribute value storing dictionary will be referred to. The 
keyword extraction section which performs one by one outputting a pair of set with the natural language expression, the 
attribute value which is 3 groups, and an attribute name as an attribute value index if the natural language expression 
expressing attribute value is included, Only when the output of the keyword extraction section is inputted, it investigates 
sequentially from a head and the attribute name index and attribute value index of the same attribute exist in next doors 
The keyword filter section which deletes said attribute name index and outputs the other part as it is, When the contents 
and the attribute value index of a text file are inputted, in the contents of said text file Investigate whether a tag including 
the attribute name in said attribute value index exists, and when it exists Take out the attribute value which exists by the 
tag and pair, and whether the value is equal to the attribute value in said attribute value index investigation and when 
equal With the contents Banking Inspection Department of a document which outputs success, and does a rejected output 
when that is not right If the contents of the text file and one or more attribute value indexes are inputted Take out one at a 
time from said attribute value index, and the contents and said every one taken-out attribute value index of said text file 
are passed to the contents Banking Inspection Department of a document. To all attribute value indexes, when the output 
is success With the contents Banking Inspection Department of an integrated document which outputs success, and 
outputs a rejection when that is not right Every one contents of the text file are taken out with reference to a text file term 
document. The contents of said document, and the part of the attribute value index of the outputs of the keyword filter 
section to the contents Banking Inspection Department of an integrated document Delivery, The success text file name 
sorting section to which it performs receiving the output of said contents Banking Inspection Department of an integrated 
document to said all text files that it took out one [ at a time ], and said output outputs only the identifier of the text file 
of success, If the contents of a text file name and said text file name and the attribute name index which is the output of 
the keyword filter section are inputted When one of said attribute name indexes is taken out, and it investigates whether a 
tag including said taken-out attribute name exists and exists in the contents of said given text file with the contents output 
section of a document which said inputted attribute name index is alike, respectively, receives, and performs displaying 
the value and said inputted text file name of a tag of the attribute name on a user, and outputting nothing in not existing 
Input the set of the text file name which is the output of said success text file name sorting section, and a text file name 
storing dictionary is referred to. Said every one inputted element of a set of a text file name was taken out, and it has the 
contents output-control section of a document which repeats performing passing the contents output section of a 
document to all the text file names under said input. 

[0021] Thus, an operation of the text file retrieval equipment of constituted this invention is explained using an example, 
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in order to make the understanding easy. First, what was shown in drawing 5 (a) and (b) is used as a WWW text file used 
as the object which a user searches. In the text file of drawing 5 , the pair of the attribute name and the attribute value of 
an attribute of an attribute other than a text text is contained. Moreover^ the following sentence is used as an example of a 
user's retrieval sentence. 

Retrieval input-statement: "I want to find the information on the hotel which it is possible for a price to take a pet in 150 
dollar, and is moreover near San Francisco" 

[0022] First, an input statement is changed into a keyword train in the 1st step. Two kinds exist as a class of keyword. 
The 1st is a natural language expression which refers to an attribute name, and it calls it an attribute name index. The 2nd 
is a natural language expression which refers to attribute value, and they call it an attribute value index. 
[0023] 

the generated keyword train: {price (attribute name index of a "price"), $150 dol (attribute value index of a "price"), and a 
pet (attribute name index of "pet possibility") — possible (attribute value index of "pet possibility"), San Francisco 
(attribute value index of a "location"), and a hotel (attribute name index of a "hotel name") 

} — (4) [0024] Next, a redundant part is unified with reference to the order of a list of an attribute name index and an 
attribute value index. When the attribute name index and attribute value index to the same attribute are located in a line 
with next doors, the attribute name index is deleted. The upper example of a keyword train is compressed as follows, 
the compressed keyword train: {$150 dol (attribute value index of a "price") — possible (attribute value index of "pet 
possibility"), San Francisco (attribute value index of a "location"), and a hotel (attribute name index of a "hotel name") 
} — (5) [0025] Next, the extracted keyword train is interpreted. An attribute value index is interpreted as the conditional 
expression of taking the value which an attribute value index holds as a value of the attribute which it refers to. For 
example, $150 Dollar (attribute value index of a "price") 
**, value = $150" of ""price" attribute 
The interpretation to say is carried out. 

[0026] When two or more attribute value indexes exist, what carried out the AND of those interpretations serves as the 
whole conditional expression. In the upper example, it is as follows. 

the whole conditional expression - {-- "-- value = [ ] of the value = $150" and ""pet possibility" attribute of a "price" 
attribute - possible --" and "value = San Francisco of a "location" attribute" 

} — (6) [0027] An attribute name index becomes the interpretation "output the value of the attribute which it refers to." In 
the upper example, it is as follows. 

Specific {hotel of a retrieval part" (attribute name index of a "hotel name") 

} — (7) [0028] This semantics serves as the interpretation "output the value of a "hotel name" attribute." When there are 
two or more attribute name indexes, it becomes the semantics "carry out the sequential output of the attribute name index 
of these plurality." 

[0029] The interpretation of the whole input statement is WWW which satisfies the retrieval conditional expression 
generated from an attribute value index. What is necessary is to extract the value of the attribute name which chooses the 
upper text file, next is specified by the interpretation of an attribute name index out of those text files, and just to display 
it on a user. 
[0030] 

[Embodiment of the Invention] When drawing 1 is referred to, the text file retrieval equipment 100 of the gestalt of 
operation of this invention The text file term document 1, the attribute name storing dictionary 2, and the attribute value 
storing dictionary 3, With the keyword extraction section 4, the keyword filter section 5, and the contents Banking 
Inspection Department 6 of a document It consists of the contents Banking Inspection Department 7 of an integrated 
document, the success text file name sorting section 8, the contents output section 9 of a document, and the contents 
output-control section 10 of a document, and a display 102 and the Internet 103, such as the input devices 101, such as a 
keyboard, and a CRT display, are accessed. 

[0031] The identifier and the physical location of all the text files used as the candidate for retrieval are stored in the text 
file term document 1 . When the text file used as the candidate for retrieval is described by HTML and XML, a text file is 
WWW in the world. Distributing to the server is also possible. In that case, the location of a text file is the URL 
"http:// ". It is described. 

[0032] The pair with the natural language expression expressing the attribute name and its attribute name of the attribute 
tag which exists in the text file used as the candidate for retrieval is registered into the attribute name storing dictionary 
2. The most fundamental thing in the natural language expression which refers to a certain attribute name is the attribute 
name itself. For example, as a natural language expression which refers to the attribute name of a "hotel", it is a "hotel." 
However, there is an expression which refers to a "hotel" besides it. For example, there are a "stay location" and "an 
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[0034] 3 groups with the natural language expression which expresses the attribute name corresponding to attribute value 
and its attribute value and its attribute value about the attribute value which exists in the text file used as the candidate 
for retrieval are stored in the attribute value storing dictionary 3. It is [ which refers to a certain attribute value ] the 
attribute value itself which is the most fundamental as a natural language expression. There is the "X hotel" itself and 
there is no other way but to be [ a natural language expression which refers to the attribute value of "X hotel" ] 
fastidious. However, in another example, an expression like animal companion being [ O.K. ] "the lump", pet being 
[ O.K. ] "the company", and "dog and cat being good" may also be registered besides "being possible" as a natural 
language expression showing the attribute value of a "pet" attribute. As shown in the following table 2, data are stored in 
the attribute value storing dictionary 3 in 3 groups. 
[0035] 

[ Table 2] 



Q ID=000004 



[0036] The keyword extraction section 4 is an input unit 101 about the input condition retrieval sentence by natural 
language expression. If it leads and receives from a user, it will investigate whether with reference to the attribute name 
storing dictionary 2 and the attribute value storing dictionary 3, there is any expression registered as a natural language 
expression in it into an input condition retrieval sentence. Only an attribute name is outputted, when it is and it is an 
attribute name. This output is called an attribute name index. On the other hand, when it is attribute value, the pair of 
attribute value and a corresponding attribute name is outputted. This output is called an attribute value index. If these are 
investigated from the head of an input condition retrieval sentence and what matches is found in them, they are outputted 
to the sequence. 

[0037] When the output of the keyword extraction section 4 is investigated sequentially from reception and a head as it is 
and the attribute name index and attribute value index of the same attribute exist in next doors, the keyword filter section 
5 deletes the attribute name index, and outputs the other part transparently as it is. 

[0038] The contents Banking Inspection Department 6 of a document receives as an input the character string and 
attribute value index which are the contents of the text file from the contents Banking Inspection Department 7 of an 
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integrated document. It investigates whether the tag which includes the attribute name in the received attribute value 
index in the contents character string of the text file received as an input exists, and when it exists, the attribute value 
which exists by that tag and pair is taken out, investigation etc. is carried out [ whether that value is equal to the attribute 
value in this attribute value index, and ], and it is, and success is outputted, and a case carries out a rejected output, when 
that is not right. The contents Banking Inspection Department 6 of a document has played a kind of subroutine-role 
which is called from the contents Banking Inspection Department 7 of an integrated document, and operates. 
[0039] The contents Banking Inspection Department 7 of an integrated document receives as an input the character string 
and one or more attribute value indexes which are the contents of the text file from the success text file name sorting 
section 8. The given attribute value index considers that each is expressing the conditional expression "the value 
described in the attribute value index must be taken as a value of the attribute described in the attribute value index." The 
role of the contents Banking Inspection Department 7 of an integrated document is investigating whether the condition 
being satisfied by finding the attribute expression described by the attribute value index out of the given character string. 
If all the conditions of one or more attribute value indexes given as an input are satisfied, the value of "success" is 
outputted, and when that is not right, the value of a "rejection" will be outputted. The contents Banking Inspection 
Department 6 of a document performs judging whether the character string which is the contents of the text file actually 
satisfies the conditions of one attribute value index. The contents Banking Inspection Department 7 of an integrated 
document is performing a kind of loop control which passes every one attribute value index to the contents Banking 
Inspection Department 6 of a document one by one, when there are two or more attribute value indexes. It is the 
subroutine-role to which the contents Banking Inspection Department 7 of an integrated document is also called from the 
success text file name sorting section 8. 

[0040] The success text file name sorting section 8 is WWW currently distributed all over the world through the Internet 
1 03 if needed with reference to the text file term document 1 . A server is accessed, every one contents of the text file are 
taken out, and the output of delivery and the contents Banking Inspection Department 7 of an integrated document is 
received for the part of an attribute value index to the contents Banking Inspection Department 7 of an integrated 
document among the contents of this document, and the output of the keyword filter section 5. As an output, the value of 
"success" or a "rejection" is returned here. This processing is performed to all the files registered into the text file term 
document 1, and a text file name is outputted to the contents output-control section 10 of a document only to the file 
whose output of the contents Banking Inspection Department 7 of an integrated document was "success." 
[0041] The contents output section 9 of a document inputs a text file name, the contents of this file, and one or more 
attribute name indexes that are the outputs of the keyword filter section 5. Take out one of the inputted attribute name 
indexes, and in the contents of the inputted text file Investigate whether a tag including the attribute name in this attribute 
name index exists, and when it exists It is a display 102 about a pair with the text file name inputted as the value of the 
attribute value tag corresponding to the attribute name tag. It leads and processing in which nothing is outputted when it 
does not display and exist in a user is performed to each of all the inputted attribute name indexes. The contents output 
section 9 of a document is carrying out the role called in subroutine by the contents output-control section 10 of a 
document. In addition, instead of outputting the pair of the value of an attribute value tag, and a text file name, you may 
make it display the value of an attribute value tag, and the positional information of a text file, and may make it display 
the value of an attribute value tag, a text file name, and its positional information. 

[0042] The contents output-control section 10 of a document considers the set of the text file name which is the output of 
the success text file name sorting section 8 as an own input as it is, and the text file term document 1 is referred to. The 
need is accepted in the contents of the text file under set of the inputted text file name, and it is the Internet 103. It leads 
and is WWW. Access a server and it takes out one at a time. It repeats performing passing the contents output section 9 
of a document to all the text file names under input with the attribute name index generated in a text file name and the 
keyword filter section 5. That is, when three text file names which passed as an input are received, the contents output 
section 9 of a document will be called 3 times. In addition, the success text file name sorting section 8 is the Internet 103. 
It is the Internet 103 by the contents output-control section 10 of a document using the contents, when the contents of the 
text file which it led and was incorporated from the WWWW server are saved at the magnetic disk drive etc. The count 
of access can be reduced. 

[0043] Drawing 2 and drawing 3 are text file retrieval equipment 100. It is the flow chart which shows the example of 
processing. Hereafter, actuation of the gestalt of this operation is explained. 

[0044] The keyword extraction section 4 is an input unit 101. If it leads and the retrieval input statement by natural 
language expression is received from a user (step SI), the attribute name storing dictionary 2 and the attribute value 
storing dictionary 3 will be referred to. Investigate sequentially from the head of a retrieval input statement, and when it 
is, whether there is any expression registered as a natural language expression in it in a retrieval input statement When it 
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is an attribute name, the attribute name index only containing an attribute name is outputted, and when it is attribute 
value, the attribute value index containing the pair of attribute value and a corresponding attribute name is outputted 
(stepS2). 

[0045] Next, the keyword filter section 5 inspects the list of the index outputted from the keyword extraction section 4, 
detects the part where the attribute name index and attribute value index of the same attribute are continuing, and deletes 
the attribute name index of the part (step S3). 

[0046] Next, the success text file name sorting section 8 takes out the contents of the document of the text file name, and 
makes the contents Banking Inspection Department 7 of an integrated document judge delivery and success or failure 
paying attention to one text file name in the text file term document 1 with all the attribute value indexes outputted from 
the keyword filter section 5 (step S4). 

[0047] The contents Banking Inspection Department 7 of an integrated document makes the contents Banking Inspection 
Department 6 of a document judge delivery and success or failure for this attribute value index and the contents of the 
text file paying attention to one of the attribute value indexes passed first, in order to inspect the passed contents of a 
document (step S5). 

[0048] The attribute name tag with which the contents Banking Inspection Department 6 of a document has the attribute 
name included in the passed attribute value index in the contents of the passed text file exists. And the value of the 
attribute value tag which is the attribute name tag which existed and pair or [ that inspect whether it is in agreement with 
the attribute value included in the passed attribute value index, and such an attribute name tag does not exist success in 
being in agreement ] - or even if it exists, when the attribute value is not in agreement, the contents Banking Inspection 
Department 7 of an integrated document is notified of a rejection (step S6) . 

[0049] When, as for the contents Banking Inspection Department 7 of an integrated document, success is notified from 
the contents Banking Inspection Department 6 of a document (it is YES at step S7), It investigates whether it finish 
inspecting about all the attribute value indexes notified from the success text file name sorting section 8. When having 
finished inspecting still, attention is moved to one of NO) and the remaining attribute value indexes at the (step S8, and 
the contents Banking Inspection Department 6 of a document is made to judge delivery and success or failure for the 
attribute value index and contents of the text file (step S9). And when success is declared about all attribute value 
indexes in the contents Banking Inspection Department 7 of a document (it is YES at step S8), success is notified to the 
success text file name sorting section 8, the success text file name sorting section 8 uses the text file concerned as a 
success text file (step S10), and it progresses to step SI 1 . On the other hand, when a rejection is notified from the 
contents Banking Inspection Department 6 of a document (it is NO at step S7), the contents Banking Inspection 
Department 7 of an integrated document notifies a rejection to the success text file name sorting section 8, and the 
success' text file name sorting section 8 progresses to step SI 1 . 

[0050] After the yes-no decision about one text file finishes, when the unsettled text file remains into the text file term 
document 1 (it is YES at step SI 1), the success text file name sorting section 8 moves attention to one of text file names 
[ them ] (step SI 2), and makes the judgment of success or failure like a previous text file. 

[005 1] If it judges whether the success text file name sorting section 8 had at least one success file (step SI 3) and there is 
no one after finishing the yes-no decision to all the text files in the text file term document 1 (it is YES at step SI 1), the 
text file corresponding to the retrieval conditions inputted, for example will process displaying the purport. which one did 
not have on a user etc., and will end processing. On the other hand, when at least one success file exists, the text file 
name of all that success file and all the attribute name indexes outputted from the keyword filter section 5 are notified to 
the contents output-control section 10 of a document, and the contents output control of a document is made to start (step 
S14). 

[0052] The contents output-control section 10 of a document takes out the contents of a document paying attention to 
one notified success file name, and the contents output section 9 of a document is made to start processing of delivery 
and the document concerned with all the notified attribute name indexes (step SI 5). 

[0053] It investigates whether the contents output section 9 of a document has an attribute name tag with the attribute 
name of the attribute name index in a document paying attention to one notified attribute name index (step SI 6) (step 
SI 7), and if it is (it is YES at step SI 8), the value and the text file name concerned of the attribute value tag 
corresponding to the attribute name tag will be displayed on a display 102 (step SI 9). If there is nothing (it is NO at step 
SI 8), step SI 9 will be skipped. Next, if the contents output section 9 of a document investigated whether the unsettled 
attribute name index would remain on the notified attribute name index (step S20) and it remains in it, it will move 
attention to one of them (step S21), and will repeat the processing returned and mentioned above to step SI 7. 
[0054] After finishing the processing about all attribute name indexes in which the contents output section 9 of a 
document was notified (it is NO at step S20), the contents output-control section 10 of a document When investigating 
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whether the unsettled thing remains to the text file notified from the success text file name sorting section 8 (step S22) 
and remaining in it, move attention to one of them, and the contents of a document of the text file name are taken out. 
The contents output section 9 of a document is passed, and is made to process with all the attribute name indexes notified 
from the success text file name sorting section 8 (step S23). It becomes processing termination after the processing about 
all success files finishes (it is YES at step S22). 

[Translation done.] 
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JPO and NCIPI are not responsible for any 
damages caused by the use of this translation. 

1 .This document has been translated by computer. So the translation may not reflect the original precisely. 
2.**** shows the word which can not be translated. 
3. In the drawings, any words are not translated. 



EXAMPLE 

[Example] Suppose that three text file names, "a file 1", "a file 2", and "a file 3 M , and URL of those are registered into the 
text file term document 1 so that it may illustrate to drawing 1 . Moreover, the contents of the file 1 show drawing 5 (a), 
and the contents of the file 2 presuppose that it is what is shown in drawing 5 (b). These files 1 and 2 are XML. It is 
described and the attribute and attribute value other than a text text are included. That is, the pair of the tag which the 
attribute name expressing the semantics written into the document of a <hotel> X hotel «location> San Francisco 
</location> <price> $150 </price> <pet> possible < / pet> attached, and the value of the attribute is contained in the file 
1. Similarly, a tag called a <hotel> Z hotel «location> Seattle </location> <price> $180 </price> <pet> improper < / 
pet> is contained also in the file 2. 

[0056] Moreover, the pair of an attribute name which is illustrated to drawing 1 , and its natural language expression 
shall be stored in the attribute name storing dictionary 2 in advance, and 3 groups of attribute value and a natural 
language expression which are illustrated to drawing 1 , and an attribute name shall be stored in the attribute value 
storing dictionary 3 in advance. In addition, if all prices are registered into the attribute value storing dictionary 3 with 
the actual value, since a number of registration will increase, you may make it register using a variable. That is, XXX If 
it registers as it is shown in the following table 3, when considering as the numeric value of arbitration, and the keyword 
extraction section 4 has a "dollar" behind the numeric value of arbitration, it will be the natural language expression 
XXX. Let what judged that a dollar existed and attached $ to the head of the actual value which existed be attribute 
value. 

[Table 3] 

\i] ID=000005 



[0057] Actuation of this example is explained to an example for the case where a user inputs the retrieval input statement 
by the following natural language by such premise. 

Retrieval input-statement: "I want to find the information on the hotel which it is possible for a price to take a pet in 150 
dollar, and is moreover near San Francisco" 

[0058] The keyword extraction section 4 will change a retrieval input statement into a keyword train as follows with 
reference to the attribute name storing dictionary 2 and the attribute value storing dictionary 3, if the retrieval input 
statement from a user is received. 

[0059] First, since the natural language expression "a price" of the head of a retrieval input statement exists in the 
attribute name storing dictionary 2, the attribute name "a price" registered into it and a pair by becoming is outputted as 
an attribute name index. Next, since a natural language expression "150 Dollar" exists in the attribute value storing 
dictionary 3, the pair of the attribute value "$150" and the attribute name "the price" which are registered in it and 3 
groups is outputted as an attribute value index. Next, since a natural language expression "a pet" exists in the attribute 
name storing dictionary 2, the attribute name "a pet" registered into it and a pair by becoming is outputted as an attribute 
name index. Next, since a natural language expression "possible" exists in the attribute value storing dictionary 3, the 
pair of the attribute value "possible" and the attribute name "the pet" which are registered in it and 3 groups is outputted 
as an attribute value index. Next, since a natural language expression "San Francisco" exists in the attribute value storing 
dictionary 3, the pair of the attribute value "San Francisco" and the attribute name "the location" which are registered in 
it and 3 groups is outputted as an attribute value index. Next, since a "hotel" exists in the attribute name storing 
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dictionary 2, the attribute name "a hotel" registered into it and a pair by becoming is outputted as an attribute name 

index. Into a retrieval input statement, other natural language expressions which match the natural language expression 

registered into the attribute name storing dictionary 2 and the attribute value storing dictionary 3 cannot be found. 

Therefore, the following keyword trains are outputted sequentially from a top. 

[0060] Attribute name index (attribute name M a price") 

Attribute value index (attribute value "$150", attribute name M a price") 

Attribute name index (attribute name "a pet") 

Attribute value index (attribute value "possible", attribute name "a pet") 

Attribute value index (attribute value " "San Francisco", attribute name "a location") 

Attribute name index (attribute name "a hotel") 

[0061] Next, the keyword filter section 5 unifies a redundant part with reference to the order of a list of an attribute name 
index and an attribute value index. Since it has ranked with next doors by the attribute name "a price" with same attribute 
name index (attribute name "price") and attribute value index (attribute value "$150", attribute name "price") in the case 
of the upper keyword train, an attribute name index (attribute name "a price") is deleted. Moreover, since it has ranked 
with next doors by the attribute name "a pet" with same attribute name index (attribute name "pet") and attribute value 
index (attribute value "possible", attribute name "pet"), an attribute name index (attribute name "a pet") is deleted. Since 
the attribute name index which should otherwise be deleted does not exist, finally the above-mentioned keyword train is 
compressed as follows. 

[0062] (a) Attribute value index (attribute value "$150", attribute name "a price") 

(b) Attribute value index (attribute value "possible", attribute name "a pet") 

(c) Attribute value index (attribute value " "San Francisco", attribute name "a location") 

(d) Attribute name index (attribute name "a hotel") 

[0063] next, the success text file name sorting section 8 - the contents of a document of the file 1 in the text file term 
document 1 - the URL - reliance -- the Internet 103 from the server which leads and corresponds - acquiring -- the 
contents of a document, and the above-mentioned attribute value index (a) - (c) The contents Banking Inspection 
Department 7 of an integrated document is passed. 

[0064] The contents Banking Inspection Department 7 of an integrated document is the contents of a document of a file 
1, and one attribute value index (a). The contents Banking Inspection Department 6 of a document is passed. 
[0065] The contents Banking Inspection Department 6 of a document is an attribute value index (a) in the contents of a 
document of a file 1. It investigates whether the tag of an inner attribute name "a price" exists, drawing 5 - (-- a --) - a 
file - one - a case ~ corresponding - a tag - < - a price -->--$ - 150 - < - /-- a price - > - it is - since - the - 
attribute value - "-- $ - 150 - " - having received - attribute value ~ an index ~ (-- a --) It investigates whether it is in 
agreement with inner attribute value "$150." In the present example, since it is in agreement, success is returned to the 
contents Banking Inspection Department 7 of an integrated document. 

[0066] The contents Banking Inspection Department 7 of an integrated document is the contents of a document of a file 
1, and the following attribute value index (b). The contents Banking Inspection Department 6 of a document is passed. 
[0067] The contents Banking Inspection Department 6 of a document is an attribute value index (b) in the contents of a 
document of a file 1. It investigates whether the tag of an inner attribute name "a pet" exists, drawing 5 ~ (-- a --) a file 
- one - a case - corresponding - a tag — < - a pet - > - possible - < - /-- a pet - > - it is - since - the - attribute 
value - "-- possible - " - having received - attribute value ~ an index - (-- b --) It investigates whether it is in 
agreement with inner attribute value "possible." In the present example, since it is in agreement, success is returned to 
the contents Banking Inspection Department 7 of an integrated document. 

[0068] The contents Banking Inspection Department 7 of an integrated document is the contents of a document of a file 
1, and the following attribute value index (c). The contents Banking Inspection Department 6 of a document is passed. 
[0069] The contents Banking Inspection Department 6 of a document is an attribute value index (c) in the contents of a 
document of a file 1. It investigates whether the tag of an inner attribute name "a location" exists, drawing^ - (- a --) - 
a file - one - a case - corresponding - a tag - < - a location -- > -- San Francisco - < -- /-- a location - > - it is - 
since - the - attribute value — "-- San Francisco — " - having received — attribute value - an index -- (-- c ~) It 
investigates whether it is in agreement with inner attribute value "San Francisco." In the present example, since it is in 
agreement, success is returned to the contents Banking Inspection Department 7 of an integrated document. 
[0070] Since the result of success was obtained by all attribute value indexes about the file 1, the contents Banking 
Inspection Department 7 of an integrated document notifies success to the success text file name sorting section 8, and 
the success text file name sorting section 8 considers a file 1 as a success file. 

[0071] Next, the success text file name sorting section 8 is the Internet 103 to reliance about the URL in the contents of a 
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document of the file 2 stored in the text file term document 1 . It leads, and incorporates from the corresponding server 
and success or failure is judged using the contents Banking Inspection Department 7 of an integrated document like the 
previous file 1 . In this case, since a pet attribute and a location attribute are not satisfied, a file 2 serves as a rejection. 
Similarly, the judgment of success or failure is performed also about the remaining file 3. Here, a file 3 is also judged to 
be a rejection and a success file presupposes that it was it only a file 1 . 

[0072] Next, the success text file name sorting section 8 is an attribute name index (d) about a file name 1 as a success 
file name. The contents output-control section 10 of a document is passed. 

[0073] The contents output-control section 10 of a document acquires URL of a file name 1 from the text file term 
document 1, and is the Internet 103 to reliance about it. The upper server is accessed, the contents of a document of a file 
name 1 are acquired, and it is an attribute name index (d). The contents output section 9 of a document is passed. 
[0074] The contents output section 9 of a document is an attribute name index (d) in the contents of a document of a file 
1. It investigates whether an attribute tag with an attribute name "a hotel" exists, since the corresponding tag <hotel> X 
hotel <a /hotel> exists in the case of the file 1 of drawing 5 (a) - the attribute value "X hotel" taking out - a file name 
1 - display 102 It displays. 

[0075] Drawing 4 is the block diagram showing the example of a hardware configuration of the text file retrieval 
equipment of this invention. The text file retrieval equipment of this example A processor 200 (CPU), The auxiliary 
storage units 201, such as a magnetic disk drive, and the interface 202 of those, The memory 203, such as RAM, and the 
interface 204 between the Internet 103, The record medium 205 which can machine read CD-ROM, semiconductor 
memory, etc., It consists of buses 209 which connect between the interface 206, an input unit 101, its interface 207, a 
display 102, its interface 208, and CPU200, memory 203 and an interface 202,204,206-208. 

[0076] The program for text file retrieval is recorded on the record medium 205, and the text file term document 1, the 
attribute name storing dictionary 2, and the attribute value storing dictionary 3 of drawing 1 are loaded on memory 203 
or an auxiliary storage unit 20 1 by installing this program through an interface 206. Moreover, this program operates 
CPU200 by controlling actuation of CPU200 as the keyword extraction section 4 of drawing 1 , the keyword filter 
section 5, the contents Banking Inspection Department 6 of a document, the contents Banking Inspection Department 7 
of an integrated document, the success text file name sorting section 8, the contents output section 9 of a document, and 
the contents output-control section 10 of a document. 

[0077] The gestalt of the above operation is WWW about this invention. Although applied to retrieval of the upper 
homepage, it is applicable also to the retrieval to a database like the technique indicated to JP,5-67136,A. 



[Translation done.] 
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DESCRIPTION OF DRAWINGS 



[Brief Description of the Drawings] 

[Drawin g 1] It is the block diagram of the text file retrieval equipment of the gestalt of operation of this invention. 
[Drawing 2] It is the flow chart which shows the example of text file retrieval equipment processing of the gestalt of 
operation of this invention. 

[Drawing 3] It is the flow chart which shows the example of text file retrieval equipment processing of the gestalt of 
operation of this invention. 

[Drawing 4] It is the block diagram showing the example of a hardware configuration of the text file retrieval equipment 
of this invention. 

[Drawing 5] XML It is drawing showing the example of description of the used text file. 
[Drawing 6] It is drawing showing the example of description of HTML. 

[Drawing 7] It is drawing showing the example which displayed the example of description of HTML of drawin g 6 by 
the browser. 

[Description of Notations] 

1 — a text file term document and 2 - an attribute name storing dictionary and 3 — an attribute value storing dictionary 
and 4 — the keyword extraction section and 5 — the keyword filter section and 6 — the contents Banking Inspection 
Department of a document, and 7 — the contents Banking Inspection Department of an integrated document, and 8 - the 
success text file name sorting section and 9 the contents output section of a document, and 10 — the contents output- 
control section of a document — it comes out. 



[Translation done.] 
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